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Q they encode, by expressing the coding polynucleotide sequences in suitable host cells. Also disclosed are isolated polypeptides hav- 
^ ing enzymatic activity of helicobacterial glycosyltransferases. Such polypeptides are particularly useful for screening of modulators 

of their enzymatic activity, in particular enzymatic inhibitors having potential antibacterial activity. 
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GLYCOSYLTRANSFERASES OF HELICOBACTER PYLORI AS A NEW 
TARGET IN PREVENTION AND TREATMENT OF H. PYLORI INFECTIONS 

5 FIELD OF THE INVENTION 

The invention relates to newly identified and isolated polynucleotides and 

polypeptides of bacterial origin, in particular to novel polynucleotides and 

polypeptides related to glycosyltransferases involved in biosynthesis of 
10 lipopolysaccharides of Helicobacter pylori. 

BACKGROUND OF THE INVENTION 

15 Helicobacter pylori is a spiral, microaerophilic, Gram-negative bacterium infecting 
about 50% of the global human population, and is now recognised as the most 
common bacterial pathogen of humans worldwide. It is the causative agent of 
chronic active gastritis in all who harbour it, is responsible for the development of 
most gastro-duodenal ulcers, and is formally recognised as the carcinogen for 

20 certain gastric cancers (Blaser, Gastroenterology 102: 720-727 (1992); 
Parsonnet ef a/, N. Engl. J. Med. 325: 1127-1131 (1991)). H. pylori is a highly 
motile organism and migrates through the superficial mucus layer of the gastric 
lumen to colonize the underlying gastric pits and associated epithelium. The 
precise mechanisms by which H. pylori injures the gastric mucosa to elicit the 

25 aforementioned pathogenic states remains unknown, but it is clear that urease 
production (Eaton ef a/, Infect. Immun. 59: 2470-2475 (1991)) and motility are 
required for gastric colonisation of experimental animals. However, the 
development of gastro-duodenal disease clearly requires additional bacterial 
virulence factors (Phadnis ef a/, Infect Immun. 62:1557-1565 (1994); Tummuru 

30 ef a/, Mol. Microbiol. 18: 867-876 (1995)). Although several bacterial adhesins 
and putative receptors on host epithelium have been described (Evans ef a/, J. 
Bacteriol. 175: 674-683 (1993); Boren et a/, Science 262: 1892-1895 (1993); 
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Odenbreit et al t Gut 37 (Suppl. 1): A1 (1995)), their role in gastric colonization by 
H, pylori has not be n clearly established. 

Gram-negative bacteria, such as H. pylori, have their bacterial cell wall covered 
5 with an outer membraneous layer consisting of lipids, proteins and 
lipopolysaccharides (LPS). LPS contain lipid A, a disaccharide of two 
phosphorylated glucosamine (GlcN) residues with attached fatty acids, and a 
polysaccharide attached to one of the glucosamine residues through a glycosidic 
bond. The polysaccharide is composed of a core of approximately 10 sugar 
10 residues followed by a repeating series of units of 3 to 5 sugars called the O side 
chain (O-chain). The number of repeating units in the O-chain varies from about 
10 to 40. The sugars found in the O-chain vary among bacterial species, whereas 
the composition of the core polysaccharide is relatively constant. 
Lipopolysaccharides are released from bacteria undergoing lysis and are toxic to 
15 animals and humans. They are often referred to as endotoxins. 

While much attention has focused on the role of bacterial and host proteins in H. 
pylori infection and immunity, the role of LPS in these processes has received 
less consideration (Moran, Aliment Pharmacol. Then 10 (suppl): 39-50 (1996); 

20 Yokota et a/, Infect Immun. 66: 3006-301 1 (1998); Wang ef a/, Mol. Microbiol. 31: 
1265-1274 (1999)). As a major cell surface component, this molecule is well 
situated to selectively interact with surface components of the host. In particular, 
LPS could facilitate initial gastric colonisation, be responsible for biological 
interactions which modify the inflammatory response, and promote a chronic 

25 infection. 

Comprehensive, detailed structural analysis of H. pylori LPS has revealed some 
unique features of the molecule which may account for certain aspects of H. 
py/ori-induced pathogenesis (Aspinall et al, Biochemistry 35: 2489-2497; 2498- 
30 2504 (1996); Aspinall et al, Eur. J. Biochem. 248: 592-601 (1997); Monteiro etal f 
J. Biol Chem. 273: 11533-11543 (1998)). In addition, H. pylori LPS, unlike 
typical LPS, has low endotoxic properties. Fresh clinical isolates usually display 
typical smooth type LPS (S-type). The O-chain polysaccharide structure of H. 

2 
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pylori type strain (NCTC1 1637) LPS is composed of a type 2 A/-acetyllactosamine 
(LacNAc) chain of various lengths and this O-chain may be partially a-L- 
fucosylated or less commonly a-D-glucosylated or a-D-galactosylated and may 
be terminated at the nonreducing end by Lewis blood group epitopes which 

5 mimic human cell surface glycoconjugates and glycolipids. However, it remains to 
be formally established if the O-chain of H. pylori LPS contributes to 
pathogenesis or generates protective immunity. For instance, the Lewis antigens 
present on the O-chain polysaccharide might reduce the immunogenicity of this 
molecule during infection, or might trigger autoimmunity. The ability to produce 

10 structurally defined truncated LPS molecules should help elucidate the biological 
role of LPS in H. pylori infection and immunity and possibly open a new approach 
to the treatment and prevention of H. pylori infections. - 

Known methods of prevention and treatment of H. pylori infections are either 
15 immunogenic or drug-based. The immunogenic approach is mostly intended to 
provide an immunogenic protection against the bacterium by vaccinating the 
individual with a usually bacterium-derived immunogen, to elicit an immune 
response of the organism to future H. pylori infections. Among many others, 
immunogens (antigens) derived from the LPS of H. pylori are known in this group 
20 of treatments (see, for example, WO 97/14782 and WO 87/07148). 

According to the second approach, H. pylori infections are treated with 
antibacterial drugs or combinations of such drugs, intended to eradicate the 
bacterial population in the infected individual. In this group of treatments, the 

25 currently most common are so called triple therapies, in which patients are 
administered simultaneously two different antibiotics and an acid secretion 
inhibiting drug. The efficacy of these therapies varies and is often adversely 
affected by the developing resistance to broad spectrum antibiotics used for this 
purpose and by side effects of antibiotic therapies, which frequently result in 

30 termination of the therapy before completely healing the infection. 



In view of the above-indicated deficiencies of the current antibiotic therapies, 
attempts are made to develop more specific drugs against H. pylori, such as 

3 
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drugs modulating the activity of enzymes specific to the bacteria (see, for 
example, US 5,801,013 and US 5,942,409). An ideal anti-helicobacterial drug 
should be selective, meaning that the drug should inhibit H. pylori but not the 
bacterial population of the microflora of the lower intestine. This means that the 

5 molecular target of the drug should be unique to H. pylori and/or should be 
related to its unique phenotypic characteristics, particularly those facilitating the 
colonization of bacterium's natural ecological niche (the human stomach). While 
improving the understanding of H. pylori pathogenesis, the present invention 
provides means for developing new anti-helicobacterial drugs possessing such 

10 desirable characteristics. 



SUMMARY OF THE INVENTION 

In one aspect, the present invention provides isolated and/or recombinant nucleic 
15 acids which encode certain glycosyltransferases of Helicobacter origin. The 
invention also provides recombinant DNA constructs and vectors containing 
polynucleotide sequences encoding such glycosyltransferases or portions 
thereof. These nucleic acids and constructs may be used to produce recombinant 
glycosyltransferases of Helicobacter origin by expressing the polynucleotide 
20 sequences in suitable host cells. 

In another aspect, the invention provides isolated polypeptides having the 
enzymatic activity of glycosyltransferases of Helicobacter origin. Such 
polypeptides are useful, among other things, for the identification of modulators, 
25 in particular inhibitors of their enzymatic activity, which inhibitors are potential 
antimicrobial agents. Using the isolated polypeptides of the present invention, 
potential inhibitors of these enzymes can be screened for antimicrobial or 
antibiotic effects, without culturing pathogenic strains of Helicobacter bacteria, 
such as H. pylori. 

30 

According to one embodiment of the invention, preferred glycosyltransferases of 
Helicobacter origin are glycosyltransferases of H. pylori involved in the 
biosynthesis of the bacterial lipopolysaccharide (LPS), in particular of LPS core or 
LPS O-chain. Disrupting genes of such glycosyltransferases in several strains of 
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H. pylori resulted in mutants unable to complete the structural assembly of LPS 
and having as a result a reduced ability to colonize the murine stomach. 

According to yet another aspect, the present invention provides novel antigens 
and vaccines used in immunization against Helicobacter bacteria, in particular H. 
pylori. The novel antigens are derived from bacteria having deactivated gene of a 
glycosyltransferase involved in the biosynthesis of the bacterial 
lipopolysaccharide, in particular of LPS core or LPS O-chain. Purified or partially 
purified LPS isolated from such mutants is a preferred antigen. 

Other advantages, objects and features of the present invention will be readily 
apparent to those skilled in the art from the following detailed description of 
preferred embodiments in conjunction with the accompanying drawings and 
claims. 

BRIEF DESCRIPTION OF THE DRAWINGS 



Fig. 1 shows amino acid sequence alignment of glycosyltransferases from H. 

20 pylori, H. influenzae, H. somnus and N. meningitidis. Multiple sequence 
alignment was performed using the Clustal Alignment Programme (Higgins et al, 
Gene 73: 237-244 (1988)). Designations on the left side refer to the origin of the 
sequences; HP0826 of genebank AE000594 (Tomb ef a/ T Nature 388:539-547 
(1997)), Haemophilus influenzae lex2B, U05670 (Cope et al, Mol. Microbiol. 5: 

25 1113-1124 (1994)), Haemophilus somnus lob1, U94833 (Inzana et a/. Infect. 
Immun. 65: 4675-4681 (1997)) and Neisseria meningitidis IgtB, AAC44085 
(Jennings ef a/, Mol. Microbiol. 18: 729-740 (1995). Numbers on the right side 
indicate amino acid positions. Gaps introduced to maximise the alignment are 
indicated by dashes. Shadings were obtained using the Genedoc Programme 

30 (www.cris.com/-ketchup/genedoc.shtml). Black indicates 100% identity, dark 
grey indicates 80% identity, and light grey indicates 60% identity. 

Fig. 2 shows a complete FAB-MS spectrum of the methylated intact LPS of 
26695::HP0826kan strain. 

35 
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Fig. 3 is a schematic showing the chemical structure of LPS from parent strains 
26695 and SS1 and isogenic mutants of HP0826, HP0159 and HP0479. 

Fig. 4 shows results of CZE-MS/MS analysis (-Hon mode) of delipidated LPS from 
5 H. pylori 26695::0159 mutant. Tandem mass spectrum of precursor ions at m/z 
902 (doubly protonated ions). Separation conditions: 10 mM ammonium acetate 
containing 5% methanol, pH 9.0, +25 kV. For MS/MS experiments, nitrogen as a 
collision gas, Ei a t>: 70 eV (laboratory frame of reference). 

10 Fig. 5 shows results of CZE-MS/MS (+ion mode) analysis of delipidated LPS from 
H. pylori 0479 mutants. Tandem mass spectrum of precursor ions at m/z 1612. 
Separation conditions: 10 mM ammonium acetate containing 5% methanol, pH 
9.0, +25 kV. For MS/MS experiments, nitrogen as a collision gas, E !a b: 60 eV 
(laboratory frame of reference). 

15 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENTS 

As used herein, the terms "identity" and "similarity" mean the degree of sequence 
20 relatedness between two or more polynucleotide or polypeptide sequences as 
determined by the match between strings of such sequences. "Identity" or 
"similarity" can be readily quantified by algorithms well known to those skilled in 
the art, implemented in a number of publicly available computer software 
packages, for example BLAST software package available from NCBI and other 
25 sources. The identity or similarity is usually expressed as a percentage of identity 
with respect to some reference sequence. For example, in a polynucleotide 
having a sequence 95% identical to a reference nucleotide sequence, 5% of the 
nucleotides of the reference sequence have been deleted or substituted with 
another nucleotide, or 5% of another nucleotides have been inserted into the 
30 reference sequence. These substitutions, insertions, and/or deletions may take 
place anywhere between 5' and 3' terminal positions, either interspersed 
individually among nucleotides of the reference sequence or in one or more 
contiguous groups within the reference sequence. 

6 
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The term "isolated" as used herein means altered by the hand of man with 
respect to its natural state. For a substance occurring in nature, it means that this 
substance has been changed or removed from its natural environment, or both, 
s For example, a polynucleotide or a polypeptide naturally present in a living 
organism is not isolated, but the same polynucleotide or polypeptide separated 
from its natural matrix and coexisting materials is isolated, as the term is 
employed herein, 

10 The term "polynucleotide" or "nucleic acid" refers to any polyribonucleotide or 
polydeoxyribonucleotide, which may be unmodified or modified RNA or DNA, 
whether single- or double-stranded. The term "polypeptide" or "protein" refers to 
any peptide or protein comprising at least two amino acid residues joined to each 
other by peptide bonds or modified peptide bonds. 

15 

The term "variant" as used herein means a polynucleotide or polypeptide that 
differs from a reference polynucleotide or polypeptide but retains its essential 
properties. A typical variant of a polynucleotide differs in nucleotide sequence 
from another, reference polynucleotide. Changes in the nucleotide sequence of 

20 the variant may or may not alter the amino acid sequence of a polypeptide 
encoded by the reference polynucleotide. A typical variant of a polypeptide differs 
in amino acid sequence from another, reference polypeptide. These difference 
are usually limited and variants of a polypeptide are closely similar overall and 
identical in many regions. A variant of a polynucleotide or polypeptide may be 

25 naturally occurring, such as an allelic variant, or may be prepared by 
mutagenesis techniques, by direct synthesis, or by other recombinant methods 
well known to those skilled in the art. 

A "fragment" can be considered as a variant of a polynucleotide or polypeptide, 
30 having the same nucleotide or amino acid sequence as part of the reference 
polynucleotide or peptide. A fragment may be "free-standing" or comprised within 
a larger polynucleotide or polypeptide, normally as a single continuous region. 
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Nucleic acids referred to herein as "recombinant" are nucleic acids which have 
been produced by recombinant DNA methodology, including those nucleic acids 
that are generated by procedures which rely upon a method of artificial 
recombination, such as polymerase chain reaction (PCR) and/or cloning into a 
vector using restriction enzymes. 

According to one aspect, the invention provides novel isolated polynucleotides 
and polypeptides, as described in greater detail below. In particular, the invention 
provides isolated polynucleotides and polypeptides related to 
glycosyltransferases involved in the biosynthesis of bacterial lipopolysaccharides 
of bacteria of the genus Helicobacter, more particularly the lipopolysaccharides of 
the species. Helicobacter pylori and various strains thereof. In a preferred 
embodiment, the glucosyltransferases as those involved in the biosynthesis of 
the bacterial LPS, in particular of LPS core or LPS O-chain. Most particularly, the 
invention provides isolated polynucleotides and polypeptides identical over their 
entire lengths to sequences set out in Table 1 . 



8 
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Table 1 . Polynucleotide and polypeptide sequences 



Sequences from strain 26695 of H. pylori 
5 A. polynucleotide sequence: ORF HP0826 [SEQ ID NO:1] 

ttgcgtgttt ttgccatttc tttaaatcaa aaagtgtgcg atacatttgg tttagttttt 60 
agagacacca caactttact caatagcatc aatgccaccc accaccaagc gcaaattttt 120 
gatgcgattt attctaaaac ttttgaaggc gggttgcacc ccttagtgaa aaagcattta 180 

10 cacccttatt tcatcacgca aaacatcaaa gacatgggga ttacaaccaa tctcatcagt 24 0 
gaggtttcta agttttatta cgctttaaaa taccatgcga agtttatgag cttgggggag 300 
cttgggtgct atgcgagtca ttattccttg tgggaaaaat gcatagaact caatgaagcg 360 
atctgtattt tagaagacga tataaccttg aaagaggatt ttaaagaggg cttggatttt 420 
ttagaaaaac acatccaaga gttaggctat atccgcttga tgcatttatt gtatgatgcc 4 80 

15 agtgtaaaaa gtgagccatt gagccataaa aaccacgaga tacaagagcg tgtggggatc 540 
attaaagctt atagcgaagg ggtggggact caaggctatg tgatcacgcc taagattgcc 600 
aaagtttttt tgaaatgcag ccgaaaatgg gttgttcctg tggatacgat aatggacgct 660 
acttttatcc atggcgtgaa aaatctggtg ttacaacctt ttgtgatcgc tgatgatgag 720 
caaatctcta cgatagcacg aaaagaagaa ccttatagcc ctaaaatcgc cttaatgaga 7 80 

20 gaactccatt ttaaatattt gaaatattgg cagtttgtat aa 822 



B. polypeptide sequence deduced from sequence A [SEQ ID NO:2] 
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Ala Leu Met Arg Glu Leu His Phe Lys Tyr Leu Lys Tyr Trp Gin Phe 
260 265 270 

Val 



C. polynucleotide sequence: ORF HP0159 [SEQ ID NO:3] 

atgagtatta ttattcctat tgtcatcgct tttgataatc actatgccat gccggctggc 60 
gtgagcttgt attccatgct agcttgcgct aaaacagaac acccccaatc acaaaatgat 120 
agtgaaaaac ttttttataa gatccactgc ctggtggata acttaagcct tgaaaaccag 180 
agcaaactaa aagagactct agcccccttt agcgcttttt cgagcctaga atttttagac 240 
atttcaaccc ccaatcttca cgccactcca atagaaccct ctgcgattga taaaatcaat 300 
gaagcttttt tgcaactcaa tatttacgct aagactcgct tttctaaaat ggtcatgtgc 360 
cgcttgtttt tggcttcttt attcccacaa tacgacaaaa tcatcatgtt tgatgcagac 420 
actttgtttt taaacgatgt gagcgagagc tttttcatcc cactagatgg ctattatttt 480 
ggagcggcta aagattttgc ttccgataaa agccctaaac attttcaaat agtgcgagaa 540 
aaagaccctc gtcaagcctt ttccctttat gagcattacc ttaatgaaag cgatatgcaa 600 
atcatctatg aaagcaatta taacgccggg tttttagtcg tgaatttaaa gctgtggcgt 660 
gctgatcatt tagaagagcg cttactcaat ttaacccatc aaaaaggcca gtgcgtgttt 720 
taccctgaac aggacctttt aacgctcgca tgctatcaaa aagttttaat cttgccttat 780 
atttataaca cccacccttt catggccaat caaaaacgct tcatccctga caaaaaagaa 840 
atcgtcatgc tgcattttta ttttgtagga aaaccttggg ttttacctac tttttcatat 900 
tctaaagaat ggcatgagac tcttttaaaa accccttttt atgctgaata ttccgtgaaa 960 
ttccttaaac aaatgacaga atgtttaagc cttaaagaca aacaaaaaac ctttgaattt 1020 
cttgcccccc tactcaataa aaaaaccctt ttagaatacg tcttttttag attgaatagg 1080 
attttcaaac gcttaaaaga aaaatttttt aactcttag 1119 



D. polypeptide sequence deduced from sequence C [SEQ ID NO:4] 
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Lys Leu Phe 


Tyr 


Lys 


He 






35 










40 






45 








His 


Cys 


Leu 


Val 


Asp 


Asn 


Leu 


Ser 


Leu 


Glu 


Asn Gin Ser 


Lys 


Leu 


Lys 




50 










55 








60 








Glu 


Thr 


Leu 


Ala 


Pro 


Phe 


Ser 


Ala 


Phe 


Ser 


Ser Leu Glu 


Phe 


Leu 


Asp 


65 










70 










75 






80 


He 


Ser 


Thr 


Pro 


Asn 


Leu 


His 


Ala 


Thr 


Pro 


He Glu Pro 


Ser 


Ala 


He 










85 










90 






95 




Asp 


Lys 


He 


Asn 


Glu 


Ala 


Phe 


Leu 


Gin 


Leu 


Asn He Tyr 


Ala 


Lys 


Thr 




100 










105 






110 






Arg 


Phe 


Ser 


Lys 


Met 


Val 


Met 


Cys Arg Leu 


Phe Leu Ala 


Ser 


Leu 


Phe 




115 








120 






125 








Pro 


Gin Tyr 


Asp 


Lys 


He 


He 


Met 


Phe Asp 


Ala Asp Thr 


Leu 


Phe 


Leu 




130 










135 








140 








Asn 


Asp Val 


Ser 


Glu 


Ser 


Phe 


Phe 


He 


Pro 


Leu Asp Gly 


Tyr 


Tyr 


Phe 


145 










150 










155 






160 


Gly 


Ala 


Ala 


Lys 


Asp 


Phe 


Ala 


Ser 


Asp 


Lys 


Ser Pro Lys 


His 


Phe 


Gin 






165 










170 






175 




He 


Val 


Arg 


Glu 


Lys 


Asp 


Pro 


Arg 


Gin 


Ala 


Phe Ser Leu 


Tyr 


Glu 


His 








180 










.185 






190 






Tyr 


Leu 


Asn 


Glu 


Ser 


Asp 


Met 


Gin 


He 


lie 


Tyr Glu Ser 


Asn 


Tyr 


Asn 




195 










200 






205 








Ala 


Gly Phe 


Leu 


Val 


Val 


Asn Leu Lys 


Leu 


Trp Arg Ala 


Asp 


His 


Leu 




210 










215 








220 
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Glu 


Glu 


Arg 


Leu 


Leu 


Asn 


Leu 


Thr 


His 


Gin 


Lys 


Gly Gin 


Cys 


val 


Pne 




225 








230 










*> ^ c 








*5 A ft 
Z 4 U 




Tyr 


Pro 


Glu 


Gin 


Asp 


Leu 


Leu 


Thr 


Leu 


Ala 


Cys 


Tyr Gin 


Lys 


val 


Leu 










245 










250 












5 


lie 


Leu 


Pro 


Tyr 
260 


He 


Tyr 


Asn 


Thr 


His 

265 


Pro 


Phe 


Met Ala 


Asn 

OTA 




Lys 




Arg 


Phe 


He 
275 


Pro 


Asp 


Lys 


Lys 


Glu 
280 


He 


Val 


Met 


Leu His 
285 


Phe 


Tyr 


Phe 




Val 


Gly 


Lys 


Pro 


Trp 


Val 


Leu 


Pro 


Thr 


Pne 


Ser 


Tyr Ser 


Lys 




Trp 


10 




290 










295 










300 










His 


Glu 


Thr 


Leu 


Leu 


Lys 


Tnr 


Pro 




Tyr 


Hid 


Glu Tyr 


OCX 


V AX 






305 










310 










J lb 












Phe 


Leu 


Lys 


Gin 


Met 


Thr 


Glu 


Cys 


Leu 


Ser 


Leu 


Lys Asp 


Lys 


Gin 


Lys 










325 










330 








335 




15 


Thr 


Phe 


Glu 


Phe 
340 


Leu 


Ala 


Pro 


Leu 


Leu 
345 


Asn 


Lys 


Lys Thr 


Leu 
350 


Leu 


Glu 




Tyr 


Val 


Phe 
355 


Phe 


Arg 


Leu 


Asn 


Arg 
360 


lie 


Phe 


Lys 


Arg Leu 
365 


Lys 


Glu 


Lys 




Phe 


Phe 


Asn 


Ser 

























20 370 



E. polynucleotide sequence: ORF HP0479 [SEQ ID NO:5] 



25 atgcatgttg cttgtctttt ggctttaggg gataatctca tcacgcttag ccttttaaaa 60 

gaaatcgctt tcaaacagca acaacccctt aaaatcctag gtactcgttt gactttaaaa 120 

atcgccaagc ttttagaatg cgaaaaacat tttgaaatca ttcctctttt tgaaaatgtc 180 

cctgcttttt atgaccttaa aaaacaaggc gtttttttgg cgatgaagga ttttttatgg 240 

ttgttaaaag cgattaaaaa gcatcaaatc aaacgtttga ttttggaaaa acaggatttt 300 

30 agaagcactt ttttagccaa attcattccc ataaccactc caaataaaga aattaaaaac 360 

gtttatcaaa accgccagga gttgttttct caaatttatg ggcatgtttt tgataacccc 420 

ccatatccca tgaatttaaa aaaccccaaa aagattttga tcaacccttt cacaagatcc 480 

atagaccgaa gtatcccttt agagcattta caaatcgttt taaaactttt aaaacccttt 540 

tgtgttacgc ttttagattt tgaagaacga tacgcttttt taaaagacag agtcgctcat 600 

35 tatcgcgcta aaaccagttt agaagaagtt aaaaacctga ttttagaaag cgatttgtat 660 

ataggagggg attcgttttt gatccatttg gcttactatt taaagaaaaa ttattttatc 720 

tttttttata gggataatga tgatttcatg ccgcctaata gtaagaataa aaattttcta 780 

aaagcccaca aaagccattc tatagaacaa gatttagcca aaaaattccg ccatttgggg 84 0 
ctattataa 849 

40 



F. polypeptide sequence deduced from sequence E [SEQ ID NO:6] 



Met 


His 


Val 


Ala 


Cys 


Leu 


Leu 


Ala 


Leu 


Gly 


Asp 


Asn 


Leu 


lie 


Thr 


Leu 


1 








5 










10 










15 




Ser 


Leu 


Leu 


Lys 


Glu 


He 


Ala 


Phe 


Lys 


Gin 


Gin 


Gin 


Pro 


Leu 


Lys 


lie 








20 










25 










30 






Leu 


Gly 


Thr 


Arg 


Leu 


Thr 


Leu 


Lys 


lie 


Ala 


Lys 


Leu 


Leu 


Glu 


Cys 


Glu 






35 










40 










45 








Lys 


His 


Phe 


Glu 


lie 


lie 


Pro 


Leu 


Phe 


Glu 


Asn 


Val 


Pro 


Ala 


Phe 


Tyr 




50 










55 










60 










Asp 


Leu 


Lys 


Lys 


Gin 


Gly 


Val 


Phe 


Leu 


Ala 


Met 


Lys 


Asp 


Phe 


Leu 


Trp 


65 










70 










75 










80 


Leu 


Leu 


Lys 


Ala 


He 


Lys 


Lys 


His 


Gin 


He 


Lys 


Arg 


Leu 


He 


Leu 


Glu 










85 










90 










95 




Lys 


Gin 


Asp 


Phe 


Arg 


Ser 


Thr 


Phe 


Leu 


Ala 


Lys 


Phe 


He 


Pro 


lie 


Thr 








100 










105 










110 






Thr 


Pro 


Asn 


Lys 


Glu 


He 


Lys 


Asn 


Val 


Tyr 


Gin 


Asn 


Arg 


Gin 


Glu 


Leu 
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115 




120 










125 








Phe 


Ser 


Gin He Tyr Gly 


His Val 


Phe 


Asp 


Asn 


Pro 


Pro 


Tyr 


Pro 


Met 




130 






135 








140 










Asn 


Leu 


Lys Asn Pro 


Lys 


Lys lie 


Leu 


He 


Asn 


Pro 


Phe 


Thr 


Arg 


Ser 


145 




150 








155 










160 


He 


Asp 


Arg Ser He 


Pro 


Leu Glu 


His 


Leu 


Gin 


He 


Val 


Leu 


Lys 


Leu 




165 








170 










175 




Leu 


Lys 


Pro Phe Cys 


Val 


Thr Leu 


Leu 


Asp 


Phe 


Glu 


Glu 


Arg 


Tyr 


Ala 




180 






185 










190 






Phe 


Leu 


Lys Asp Arg 


Val 


Ala His 


Tyr 


Arg 


Ala 


Lys 


Thr 


Ser 


Leu 


Glu 






195 




200 










205 








Glu 


Val 


Lys Asn Leu 


He 


Leu Glu 


Ser 


Asp 


Leu 


Tyr 


He 


Gly 


Gly 


Asp 




210 






215 








220 










Ser 


Phe 


Leu He His 


Leu 


Ala Tyr 


Tyr 


Leu 


Lys 


Lys 


Asn 


Tyr 


Phe 


He 


225 






230 








235 










240 


Phe 


Phe 


Tyr Arg Asp 


Asn 


Asp Asp 


Phe 


Met 


Pro 


Pro 


Asn 


Ser 


Lys 


Asn 






245 








250 










255 




Lys 


Asn 


Phe Leu Lys 


Ala 


His Lys 


Ser 


His 


Ser 


He 


Glu 


Gin 


Asp 


Leu 






260 






265 










270 






Ala 


Lys 


Lys Phe Arg His 


Leu Gly 


Leu 


Leu 















275 280 



G. polynucleotide sequence: ORF 1 191 [SEQ ID NO:7J 

atgagcgtaa atgcacccaa acgcatgcgt attttattgc gtttgcctaa ttggttaggc 60 

gatggggtga tggcaagttc gcttttttac acccttaaac accactaccc taacgcgcat 120 

tttatcttag tgggcccaac cattacttgc gaacttttca aaaaagatga aaaaatagaa 180 

gccgttttta tagacaacac caaaaaatcc tttttcaggc tgctagccat tcacaaactc 240 

gctcaaaaaa tagggcgttg cgatatagcg atcactttaa acaaccattt ctattccgct 300 

tttttgctct atgcgacaaa aacgcccgtt cgcatcggtt ttgctcaatt ttttcgttct 360 

ttgtttctca gccatgcgat cgctcctgcc cctaaagagt atcaccaagt ggaaaagtat 420 

tgctttttat tttcgcaatt tttagaaaaa gaattggatc aaaaaagcgt tttaccctta 480 

aaactggcct ttaacctccc cactcacacc ccaaacaccc ctaaaaaaat cggctttaac 54 0 

cctagcgcaa gctatgggag tgctaaaaga tggccagctt cttattacgc tgaagtttct 600 

gctgttttgt tagaaaaagg gcatgaaatt tatttttttg gggctaaaga agacgctatc 660 

gtttctgaag aaattttaaa actcatcaaa ggctcattaa aaaacccctc attgttccat 720 

aacgcttaca atctgtgcgg gaaaacaagc attgaagaat tgatagagcg catcgctgtt 780 

ttagatttat tcatcactaa cgatagcggc cctatgcatg tggctgctag catgcaaacc 84 0 

cccttaatcg ctctttttgg ccccactgat gaaaaagaga ctcgccccta taaagctcaa 900 

aaaacgatcg tattgaacca ccatttaagc tgtgcgcctt gcaagaaacg agtttgccct 960 

ttaaagaatg caaaaaacca tttgtgcatg aaatctatca cgccccttga agtcctagaa 1020 

gccgctcaca ctcttttaga agagccttaa 1050 



H. polypeptide sequence deduced from sequence G [SEQ ID NO:8] 



Met Ser Val Asn Ala Pro 


Lys 


Arg 


Met 


Arg 


He 


Leu 


Leu 


Arg 


Leu 


Pro 


1 5 








10 










15 




Asn Trp Leu Gly Asp Gly 


Val 


Met 


Ala 


Ser 


Ser 


Leu 


Phe 


Tyr 


Thr 


Leu 


20 






25 










30 






Lys His His Tyr Pro Asn 


Ala 


His 


Phe 


He 


Leu 


Val 


Gly Pro 


Thr 


He 


35 




40 










45 








Thr Cys Glu Leu Phe Lys 


Lys 


Asp 


Glu 


Lys 


He 


Glu 


Ala 


Val 


Phe 


He 


50 


55 










60 










Asp Asn Thr Lys Lys Ser 


Phe 


Phe 


Arg 


Leu 


Leu 


Ala 


He 


His 


Lys 


Leu 


65 70 










75 










80 


Ala Gin Lys He Gly Arg 


Cys 


Asp 


He 


Ala 


He 


Thr 


Leu 


Asn 


Asn 


His 


85 








90 










95 
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Phe Tyr Ser Ala Phe Leu Leu Tyr Ala Thr Lys Thr Pro Val Arg lie 

100 105 110 

Gly Phe Ala Gin Phe Phe Arg Ser Leu Phe Leu Ser His Ala lie Ala 
115 120 125 

5 Pro Ala Pro Lys Glu Tyr His Gin Val Glu Lys Tyr Cys Phe Leu Phe 
130 135 140 

Ser Gin Phe Leu Glu Lys Glu Leu Asp Gin Lys Ser Val Leu Pro Leu 
145 150 155 160 

Lys Leu Ala Phe Asn Leu Pro Thr His Thr Pro Asn Thr Pro Lys Lys 
10 165 170 175 

lie Gly Phe Asn Pro Ser Ala Ser Tyr Gly Ser Ala Lys Arg Trp Pro 

180 185 190 

Ala Ser Tyr Tyr Ala Glu Val Ser Ala Val Leu Leu Glu Lys Gly His 
195 200 205 

15 Glu lie Tyr Phe Phe Gly Ala Lys Glu Asp Ala He Val Ser Glu Glu 
210 215 220 

He Leu Lys Leu He Lys Gly Ser Leu Lys Asn Pro Ser Leu Phe His 
225 230 235 240 

Asn Ala Tyr Asn Leu Cys Gly Lys Thr Ser He Glu Glu Leu He Glu 
20 245 250 255 

Arg He Ala Val Leu Asp Leu Phe He Thr Asn Asp Ser Gly Pro Met 

260 265 270 

His Val Ala Ala Ser Met Gin Thr Pro Leu He Ala Leu Phe Gly Pro 
275 280 285 

25 Thr Asp Glu Lys Glu Thr Arg Pro Tyr Lys Ala Gin Lys Thr He Val 
290 295 300 

Leu Asn His His Leu Ser Cys Ala Pro Cys Lys Lys Arg Val Cys Pro 
305 310 315 320 

Leu Lys Asn Ala Lys Asn His Leu Cys Met Lys Ser He Thr Pro Leu 
30 325 330 335 

Glu Val Leu Glu Ala Ala His Thr Leu Leu Glu Glu Pro 
340 345 



35 Sequences from strain SS1 of H. pylori 

I. polynucleotide sequence: ORF SS0826 [SEQ ID NO:9] 



ttgcgtattt ttatcatttc tttaaatcaa aaagtgtgcg ataaatttgg tttggttttt 60 
40 agagacacca cgactttact caatagcatc aatgccaccc accaccaagt gcaaattttt 120 
gatgcgattt attctaaaac ttttgaaggc gggttgcacc ccttagtgaa aaagcattta 1B0 
cacccttatt tcatcacgca aaacatcaaa gacatgggaa ttacaaccag tctcatcagt 240 
gaggtttcta agttttatta cgctttaaaa taccatgcga agtttatgag cttgggagag 3 00 
cttgggtgct atgcgagcca ttattccttg tgggaaaaat gcatagaact caatgaagcg 3 60 
45 atctgtattt tagaagacga tataaccttg aaagaggatt ttaaagaggg cttggatttt 420 
ttagaaaaac acatccaaga gttaggctat gttcgcttga tgcatttatt atatgatccc 4 80 
aatattaaaa gtgagccatt gaaccataaa aaccacgaga tacaagagcg tgtagggatt 540 
attaaagctt atagcgaagg ggtggggact caaggctatg tgatcacgcc caagattgcc 600 
aaagttttta aaaaacacag ccgaaaatgg gttgttcctg tggatacgat aatggacgct 660 
50 acttttatcc atggcgtgaa aaatctggtg ttacaacctt ttgtgatcgc tgatgatgag 720 
caaatctcta cgatagcgcg aaaagaacaa ccttatagcc ctaaaatcgc cttaatgaga 780 
gaactccatt ttaaatattt gaaatattgg cagtttatat ag 822 



55 J. polypeptide sequence deduced from sequence I [SEQ ID NO:10] 



Leu Arg He Phe He He Ser Leu Asn Gin Lys Val Cys Asp Lys Phe 
1 5 10 15 
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Gly Leu Val Phe Arg Asp Thr Thr Thr Leu Leu Asn Ser He Asn Ala 

20 25 30 

Thr His His Gin Val Gin He Phe Asp Ala He Tyr Ser Lys Thr Phe 
35 40 45 

5 Glu Gly Gly Leu His Pro Leu Val Lys Lys His Leu His Pro Tyr Phe 
50 55 60 

He Thr Gin Asn He Lys Asp Met Gly He Thr Thr Ser Leu He Ser 
65 70 75 80 

Glu Val Ser Lys Phe Tyr Tyr Ala Leu Lys Tyr His Ala Lys Phe Met 
10 85 90 95 

Ser Leu Gly Glu Leu Gly Cys Tyr Ala Ser His Tyr Ser Leu Trp Glu 

100 105 110 

Lys Cys He Glu Leu Asn Glu Ala He Cys He Leu Glu Asp Asp He 
115 120 125 

15 Thr Leu Lys Glu Asp Phe Lys Glu Gly Leu Asp Phe Leu Glu Lys His 
130 135 140 

He Gin Glu Leu Gly Tyr Val Arg Leu Met His Leu Leu Tyr Asp Pro 
145 150 155 . 160 

Asn He Lys Ser Glu Pro Leu Asn His Lys Asn His Glu He Gin Glu 
20 165 170 175 

Arg Val Gly He He Lys Ala Tyr Ser Glu Gly Val Gly Thr Gin Gly 

•180 185 190 

Tyr Val He Thr Pro Lys He Ala Lys Val Phe Lys Lys His Ser Arg 
195 200 205 

25 Lys Trp Val Val Pro Val Asp Thr He Met Asp Ala Thr Phe He His 
210 215 220 

Gly Val Lys Asn Leu Val Leu Gin Pro Phe Val lie Ala Asp Asp Glu 
225 230 235 240 

Gin He Ser Thr He Ala Arg Lys Glu Gin Pro Tyr Ser Pro Lys He 
30 245 250 255 

Ala Leu Met Arg Glu Leu His Phe Lys Tyr Leu Lys Tyr Trp Gin Phe 

260 265 270 

He 



35 



K. polynucleotide sequence: ORF SS0159 [SEQ ID NO:1 1] 



atgagtatta ctattcctat tgttatcgct tttgacaatc attacgccat tccggctggc 60 

gtgagcctgt attccatgct agcttgcact aaaacagaac acccccaatc acaaaatgat 120 

40 agtgaaaaac ttttttataa aatccactgc ctggtagata acttaagcct tgaaaaccag 180 

tgcaaattga aagaaactct agcccccttt agcgctttta tgagcgtgga ttttttagac 240 

atttcaaccc ctaatcttta caccccttca atagaaccct ctgcgattga taaaatcaat 300 

gaagcttttt tgcaactcaa tatttacgct aagactcgct tttctaaaat ggtcatgtgc 360 

cgcttgtttt tggcttcttt attcccgcaa tacgacaaaa tcatcatgtt tgatgcggac 420 

45 actttgtttt taaacgatgt gagcgagagt ttttttatcc cgctagatgg ttattatttt 480 

ggagcggcta aagatttttc ttctcctaaa aaccttaaac attttcaaac agaaagggag 540 

agagagcctc gccaaaaatt ttttctccat gagcattacc ttaaagaaaa agacatgaaa 600 

atcatttgtg aaaaccacta taatgttggg tttttaatcg tgaatttaaa gctgtggcgt 660 

. gctgatcatt tagaagaacg cttactcaat ttaacccatc aaaaaggcca gtgtgtgttt 720 

50 tgccctgaac aggatatttt aacgctcgca tgctatcaaa aagttttaca attaccttat 780 

atttacaaca cccacccttt catggtcaat caaaaacgct tcatccctaa caaaaaagaa 840 

atcgtcatgc tgcattttta ttttgtagga aaaccttggg ttttacccac tgctttatat 900 

tctaaagaat ggcatgagac tcttttaaaa accccttttt acgctgaata ttccgtgaaa 960 

tttcttaaac aaatgacaga atttttaagc cttaaagaca aacaaaaaac ctttgaattt 1020 

55 cttgcccccc tactcaataa aaaaaccctt ttagaatatg tcttttttag attgaatagg 10 80 

attttcaaac gcttaaaaga aaaactttta aactcttagc 1120 
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L. polypeptide sequence deduced from sequence K [SEQ ID NO:12] 





Met 


Ser 


He 


Thr 


He 


Pro 


lie 


Val 


He 


Ala 


Phe 


Asp 


Asn 


His 


Tyr Ala 




1 








5 










10 










15 




5 


lie 


Pro 


Ala Gly 


Val 


Ser 


Leu 


Tyr 


Ser 


Met 


Leu 


Ala 


Cys 


Thr 


Lys 


Thr 










20 




















J U 








Glu 


HIS 


Pro 


Gin 


ser 


bin 


Asn 


Asp 


Ser 


olU 


Lys 


T on 
Leu 


r*ne 


Tyr 


Lys lie 








35 










4 0 




















His 


Cys 


Leu 


Val 


Asp 


Asn 


Leu 


oer 


Leu 


OlU 


Asn 


uill 


Cys 


Lys 


Leu 


Lys 


10 




50 










55 










60 












Glu 


Thr 


Leu 


Ala 


Pro 


pne 


ser 


Aia 


Fne 


riet. 


Car 


vai 


Asp 




Leu 


Asp 




65 










70 










75 










80 




He 


Ser 


Thr 


Pro 


Asn 
85 


Leu 


Tyr 


Thr 


Pro 


Ser 
90 


lie 


Glu 


Pro 


Ser 


Ala 
95 


lie 


15 


Asp 


Lys 


He 


Asn 
100 


Glu 


Ala 


Phe 


Leu 


Gin 
105 


Leu 


Asn 


lie 


Tyr 


Ala 
110 


Lys 


Thr 




Arg 


Phe 


Ser Lys 


Met 


Val 


Met 


Cys 


Arg 


Leu 


pne 


Leu 


Ala 


Cay 

ser 


Leu 


Phe 








115 










120 










125 










Pro 


Gin 


Tyr Asp 


Lys 


lie 


lie 


Met 


Phe 


Asp 


Ala 


Asp 


Thr 


Leu 


Phe 


Leu 


20 




130 










135 










140 












Asn 


Asp 


Val Ser 


Glu 


Ser 


Phe 


Til. ^ 

Phe 


He 


Pro 


Leu 


Asp 


Gly 


Tyr 


Tyr 


Phe 




145 










150 










155 










160 




Gly 


Ala 


Ala 


Lys 


Asp 
165 


Phe 


Ser 


Ser 


Pro 


Lys 
170 


Asn 


Leu 


Lys 


His 


Phe 
175 


Gin 


25 


Thr 


Glu 


Arg 


Glu 
180 


Arg 


Glu 


Pro 


Arg 


Gin 
185 


Lys 


Phe 


Phe 


Leu 


His 
190 


Glu 


His 




Tyr 


Leu 


Lys 


Glu 


Lys 


Asp 


Met 


Lys 


lie 


lie 


Cys 


Glu 


Asn 


His 


Tyr Asn 








195 










200 










205 










Val 


Gly 


Phe 


Leu 


He 


Val 


Asn 


Leu 


Lys 


Leu 


Trp 


Arg 


Ala 


Asp 


His 


Leu 


30 




210 










215 










220 












Glu 


Glu 


Arg 


Leu 


Leu 


Asn 


Leu 


Thr 


His 


Gin 


Lys 


Gly 


Gin 


Cys 


Val 


Phe 




225 










230 










235 










240 




Cys 


Pro 


Glu 


Gin 


Asp 
245 


lie 


Leu 


Thr 


Leu 


Ala 
250 


Cys 


Tyr 


Gin 


Lys 


Val 
255 


Leu 


35 


Gin 


Leu 


Pro 


Tyr 
260 


He 


Tyr 


Asn 


Thr 


His 
265 


Pro 


Phe 


Met 


Val 


Asn 
270 


Gin 


Lys 




Arg 


Pfle 


He 
275 


Pro 


Asn 


Lys 


Lys 


280 


lie 


vai 


fie c 


Leu 


His 
285 


pne 


Tyr 


Phe 




Val 


Gly 


Lys 


Pro 


Trp 


Val 


Leu 


Pro 


Thr 


Ala 


Leu 


Tyr 


Ser 


Lys 


Glu Trp 


40 




290 










295 










300 












His 


Glu 


Thr 


Leu 


Leu 


Lys 


Thr 


Pro 


Phe 


Tyr 


Ala 


Glu 


Tyr 


Ser 


Val 


Lys 




305 










310 










315 










320 




Phe 


Leu 


Lys 


Gin 


Met 
325 


Thr 


Glu 


Phe 


Leu 


Ser 
330 


Leu 


Lys 


Asp 


Lys 


Gin 
335 


Lys 


45 


Thr 


Phe 


Glu 


Phe 
340 


Leu 


Ala 


Pro 


Leu 


Leu 
345 


Asn 


Lys 


Lys 


Thr 


Leu 
350 


Leu 


Glu 




Tyr 


Val 


Phe 


Phe 


Arg 


Leu 


Asn 


Arg 


He 


Phe 


Lys 


Arg 


Leu 


Lys 


Glu 


Lys 



355 360 365 



Leu Leu Asn Ser 
50 370 



M. polynucleotide sequence: ORF SS0479 [SEQ ID NO: 13] 

55 atgcatgttg cttgtctttt ggctttaggg gataacctca tcacgcttag cctttgtgaa 60 
gaaatcgctc tcaaacagca acaacccctt aaaatcctag gtactcgttt gactttaaaa 120 
atcgccaagc ttttagaatg cgaaaaacat tttgaaatca ttcctgtttt taaaaatatc 180 
cccgcttttt atgaccttaa aaaacaaggc gttttttggg cgatgaagga ttttttatgg 240 
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ttattaaaag cgcttaagaa gcacaaaatc 
agaagcgctc ttttatccaa atttgtttcc 
gcttatcaaa accgccagga gttgttttct 
caatattcca tgagtttaaa aaaccccaaa 

5 aataatagaa atatttcttt agaacatttg 
tgtgttacgc ttttagattt tgaagaacga 
tatcgcgcta aaaccagttt agaagaagct 
ataggggggg attcgttttt gatccatttg 
tttttttata gggataatga cgatttcatg 

10 cataaaagcc atttcataga gcaggattta 
taa 



aaacacttga ttttagaaaa acaagatttt 3 00 

ataaccactc caaataaaga aattaaaaat 360 

caaatttatg ggcatgtttt tgataatagt 420 

aagattttaa tcaacccttt cacgagagaa 480 

caaatcgttt taaaactgtt aaaacccttt 540 

tacgcttttt taaaagatga agtcgctcat. 600 

aaaaacctga ttttagaaag cgatttgtat 660 

gcttactatt taaagaaaaa ttattttatc 720 

ccgcctaaga atgaaaattt tctaaaagcc 780 

gccacccagt tccgccattt ggggctatta 840 

843 



N. polypeptide sequence deduced from sequence M [SEQ ID NO: 14] 

15 





Met 


His 


Val 


Ala 


Cys 


Leu 


Leu 


Ala 


Leu 


Gly 


Asp 


Asn 


Leu 


He 


Thr 


Leu 




1 








5 










10 










15 






Ser 


Leu 


Cys 


Glu 


Glu 


He 


Ala 


Leu 


Lys 


Gin 


Gin 


Gin 


Pro 


Leu 


Lys 


He 








20 










25 










30 






20 


Leu 


Gly 


Thr Arg 


Leu 


Thr 


Leu 


Lys 


He 


Ala 


Lys 


Leu 


Leu 


Glu 


Cys 


Glu 








35 










40 










45 










Lys 


His 
50 


Phe 


Glu 


He 


He 


Pro 
55 


Val 


Phe 


Lys 


Asn 


He 
60 


Pro 


Ala 


Phe 


Tyr 




Asp 


Leu 


Lys 


Lys 


Gin 


Gly 


Val 


Phe 


Trp 


Ala 


Met 


Lys 


Asp 


Phe 


Leu 


Trp 


25 


65 










70 










75 










80 




Leu 


Leu 


Lys 


Ala 


Leu 
85 


Lys 


Lys 


His 


Lys 


He 
90 


Lys 


His 


Leu 


He 


Leu 

•95 


Glu 




Lys 


Gin 


Asp 


Phe 
100 


Arg 


Ser 


Ala 


Leu 


Leu 
105 


Ser 


Lys 


Phe 


Val 


Ser 
110 


He 


Thr 


30 


Thr 


Pro 


Asn 
115 


Lys 


Glu 


He 


Lys 


Asn 
120 


Ala 


Tyr 


Gin 


Asn 


Arg 
125 


Gin 


Glu 


Leu 




Phe 


Ser 
130 


Gin 


He 


Tyr 


Gly 


His 
135 


Val 


Phe 


Asp 


Asn 


Ser 
140 


Gin 


Tyr 


Ser 


Met 




Ser 


Leu 


Lys 


Asn 


Pro 


Lys 


Lys 


He 


Leu 


He 


Asn 


Pro 


Phe 


Thr 


Arg 


Glu 


35 


145 










150 










155 










160 




Asn 


Asn 


Arg Asn 


He 


Ser 


Leu 


Glu 


His 


Leu 


Gin 


He 


Val 


Leu 


Lys 


Leu 












165 










170 










175 






Leu 


Lys 


Pro 


Phe 
180 


Cys 


Val 


Thr 


Leu 


Leu 
185 


Asp 


Phe 


Glu 


Glu 


Arg 
190 


Tyr 


Ala 


40 


Phe 


Leu 


Lys 
195 


Asp 


Glu 


Val 


Ala 


His 
200 


Tyr 


Arg 


Ala 


Lys 


Thr 
205 


Ser 


Leu 


Glu 




Glu 


Ala 
210 


Lys 


Asn 


Leu 


He 


Leu 
215 


Glu 


Ser 


Asp 


Leu 


Tyr 
220 


He 


Gly 


Gly 


Asp 




Ser 


Phe 


Leu 


He 


His 


Leu 


Ala 


Tyr 


Tyr 


Leu 


Lys 


Lys 


Asn 


Tyr 


Phe 


He 


45 


225 










230 










235 










240 




Phe 


Phe 


Tyr Arg 


Asp 


Asn 


Asp 


Asp 


Phe 


Met 


Pro 


Pro 


Lys 


Asn 


Glu 


Asn 












245 










250 










255 






Phe 


Leu 


Lys 


Ala 
260 


His 


Lys 


Ser 


His 


Phe 
265 


He 


Glu 


Gin 


Asp 


Leu 
270 


Ala 


Thr 


50 


Gin 


Phe 


Arg His 


Leu 


Gly 


Leu 


Leu 



















275 280 



Sequences from str ain PJ1 of H. pylori 

55 

O. polynucleotide sequence: ORF PJ1-0479 [SEQ ID NO:15] 

atgcatgttg cttgtctttt ggctttaggg gataacctca tcacgcttag ccttttaaaa 60 
gaaatcgctt ccaaacagca acggcccctt aaaatcctag gcactcgttt gactttaaaa 120 
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atcgccaagc ttttagaatg cgaaaaacat tttgaaatca ttcctatttt tgaaaatatc 180 

cctgcttttt atgatcttaa aaaacaaggc gttttttggg cgatgaagga ttttttatgg 24 0 

ttgttaaaag caattaagaa gcacaaaatc aaacatttga ttttagaaaa acaagatttt 300 

agaagttttc ttttatccaa atttgtttcc ataaccactc ccaataaaga aattaaaaac 360 

5 gtttatcaaa accgccagga gttgttttct ccaatttatg ggcatgtttt tgataacccc 420 

ccatatccca tgaatttaaa aaaccccaaa aagattttga tcaacccttt cacaagatcc 480 

atagagcgaa gtatcccttt agagcattta aaaatcgttt taaaactctt aaaacccttt 540 

tgtgttacgc ttttagattt tgaagaacga tacgcttttt tacaaaatga agccactcat 600 

tatcgtgcta aaaccagttt agaagaagtt aaaagcctga ttttagaaag cgatttgtat 660 

10 ataggggggg attcgttttt aatccatttg gcttactatt taaagaaaaa ttattttatc 720 

tttttttata gggataatga cgatttcatg ccacctaatg gtaagaagga aaattttcta 780 

aaagcccaca aaagccatta catagaacag gatttagcca aaaaattccg ccatttgggg 840 
cttattataa 850 

15 

P. polypeptide sequence deduced from sequence O [SEQ ID NO:1 6] 





Met 


His 


Val 


Ala 


Cys 


Leu 


Leu 


Ala 


Leu 


Gly 


Asp 


Asn 


Leu 


He 


Thr 


Leu 


20 


1 








5 










10 










15 






Ser 


Leu 


Leu . Lys 


Glu 


He 


Ala 


Ser 


Lys 


Gin 


Gin 


Arg 


Pro 


Leu 


Lys 


He 










20 










25 










30 








Leu 


Glv 


Thr 
35 




Leu 


Thr 


Leu 


Lys 
i 

40 


He 


Ala 


Lys 


Leu 


Leu 
45 


Glu 


Cys 


Glu 


25 


Lys 


His 
50 


Phe 


Glu 


He 


He 


Pro 
55 


He 


Phe 


Glu 


Asn 


He 
60 


Pro 


Ala 


Phe 


Tyr 




Asp 


Leu 


Lys 


Lys 


Gin 


Gly 


Val 


Phe 


Trp 


Ala 


Met 


Lys 


Asp 


Phe 


Leu 


Trp 




65 










70 










75 










80 




Leu 


Leu 


Lys 


Ala 


He 


Lys 


Lys 


His 


Lys 


He 


Lys 


His 


Leu 


He 


Leu 


Glu 


30 










85 










90 










95 






Lys 


Gin 


Asp 


Phe 
100 


Arg 


Ser 


Phe 


Leu 


Leu 
105 


Ser 


Lys 


Phe 


Val 


Ser 
110 


He 


Thr 




Thr 


Pro 


Asn 
115 


Lys 


Glu 


He 


Lys 


Asn 
120 


Val 


Tyr 


Gin 


Asn 


Arg 
125 


Gin 


Glu 


Leu 


35 


Phe 


Ser 
130 


Pro 


He 


Tyr 


Gly 


His 
135 


Val 


Phe 


Asp 


Asn 


Pro 
140 


Pro 


Tyr 


Pro 


Met 




Asn 


Leu 


Lys 


Asn 


Pro 


Lys 


Lys 


He 


Leu 


He 


Asn 


Pro 


Phe 


Thr 


Arg 


Ser 




145 










150 










155 










160 




He 


Glu 


Arg 


Ser 


He 


Pro 


Leu 


Glu 


His 


Leu 


Lys 


He 


Val 


Leu 


Lys 


Leu 


40 










165 










170 










175 






Leu 


Lys 


Pro 


Phe 
180 


Cys 


Val 


Thr 


Leu 


Leu 
185 


Asp 


Phe 


Glu 


Glu 


Arg 
190 


Tyr 


Ala 




Phe 


Leu 


Gin 
195 


Asn 


Glu 


Ala 


Thr 


His 
200 


Tyr 


Arg 


Ala 


Lys 


Thr 
205 


Ser 


Leu 


Glu 


45 


Glu 


Val 
210 


Lys 


Ser 


Leu 


He 


Leu 
215 


Glu 


Ser 


Asp 


Leu 


Tyr 
220 


He 


Gly 


Gly 


Asp 




Ser 


Phe 


Leu 


He 


His 


Leu 


Ala 


Tyr 


Tyr 


Leu 


Lys 


Lys 


Asn 


Tyr 


Phe 


He 




225 










230 










235 










240 




Phe 


Phe 


Tyr 


Arg 


Asp 


Asn 


Asp 


Asp 


Phe 


Met 


Pro 


Pro 


Asn 


Gly 


Lys 


Lys 


50 










245 










250 










255 






Glu 


Asn 


Phe 


Leu 
260 


Lys 


Ala 


His 


Lys 


Ser 
265 


His 


Tyr 


He 


Glu 


Gin 
270 


Asp 


Leu 




Ala 


Lys 


Lys 


Phe 


Arg 


His 


Leu 


Gly 


Leu 


He 


He 













275 280 



55 



17 



WO 01/00796 



PCT/CA00/00777 



Preferred embodiments of the invention are polynucleotides coding for H. pylori 
glycosyltransferases involved in the biosynthesis of the core or O-chain regions 
of the bacterial lipopolysaccharide (LPS), in particular polynucleotides having 
sequences shown in Table 1 (SEQ ID NO: 1, 3, 5, 7, 9, 11 f 13 and 15) ? 

5 polynucleotides closely related thereto, as well as fragments and variants thereof. 
Another preferred embodiments of the invention are polynucleotides that are at 
least 70% identical over their entire length to polynucleotides shown in Table 1 , 
preferably at least 80% identical, more preferably at least 90% identical, most 
preferably at least 95% identical, and polynucleotides that are complementary to 

10 such polynucleotides. Furthermore, those with at least 97% are highly preferred 
among those with at least 95%, and among these those with at least 98% and at 
least 99% are particularly highly preferred, with at least 99% being the most 
preferred. 

is Of the polynucleotides showing substantial identity to the polynucleotides shown 
in Table 1 (SEQ ID NO: 1, 3, 5, 7, 9, 11, 13 and 15), preferred are those which 
encode polypeptides showing substantially the same biological function or activity 
as the polypeptides shown in Table 1 (SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14 and 
16). 

20 

Polynucleotides shown in Table 1 correspond to open reading frames HP0826 
(SEQ ID NO: 1), HP0159 (SEQ ID NO: 3), HP0479 (SEQ ID NO: 5) and HP1191 
(SEQ ID NO:7) of the genomic DNA of H. pylori strain 26695, to open reading 
frames SS0826 (SEQ ID NO: 9), SS0159 (SEQ ID NO: 11) and SS0479 (SEQ ID 

25 NO: 13) of the genomic DNA of H. pylori strain SS1, and to open reading frame 
PJ1-0479 (SEQ ID NO:15) of the genomic DNA of H. pylori strain PJ1. Among 
several others, ORFs HP0826, HP0159, HP0479 and HP1191 have been 
identified using the complete annotated genome sequence of H. pylori strain 
26695 and BLAST analysis as potentially coding for glycosyltransferases. They 

30 have been proven, directly or indirectly, to encode a p-1,4-galactosyltransferase 
(HP0826), a a-1 ,6-glucosyltransferase (HP0159), a heptosyltransferase 
(HP0479), and an ADP-heptose-LPS heptosyltransferase II (HP1191), which are 
enzymes involved in the biosynthesis of the H. pylori lipopolysaccharide. ORFs 
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identified by BLAST analysis have been cloned, expressed, and isolated using 
techniques well known to those skilled in the art, also discussed more in detail 
further in this disclosure. 

5 The isolated polynucleotides of the present invention can be used in the 
production of polypeptides they encode. For example, a polynucleotide 
containing all or part of the coding sequence for a Helicobacter 
glycosyltransferase can be incorporated into various DNA constructs, such as 
expression cassettes, and vectors, such as recombinant plasmids, adapted for 

10 further manipulation of polypeptide sequences or for the production of the 
encoded polypeptide in suitable host cells, either eukaryotic, such as yeast or 
plant cells, or prokaryotic, such as bacteria, for example E. coli This can be 
achieved using recombinant DNA techniques and methodologies well known to 
those skilled in the art. 

15 

Thus the present invention further provides recombinant nucleic acids comprising 
polynucleotide sequences which encode glycosyltransferases involved in the 
biosynthesis of lipopolysaccharides of bacteria of the genus Helicobacter, more 
particularly of lipopolysaccharides of the species Helicobacter pylori and various 

20 strains thereof. Most particularly, the invention provides recombinant nucleic 
acids comprising polynucleotides identical over their entire lengths to 
polynucleotides having sequences set out in Table 1 , as well as fragments and 
variants of such sequences. Among fragments and variants, preferred are those 
coding for polypeptides retaining the biological function or activity of the reference 

25 polypeptides. 

The isolated polynucleotides and fragments thereof can also be used as DNA 
diagnostic probes specific to H. pylori, for diagnostic or similar purposes. They 
may be used, for example, to check whether or not the polynucleotides according 
30 to the present invention are transcribed in bacteria of an infected tissue. They 
may be also useful in diagnosis of the stage of infection and determining the 
specific pathogen involved. 
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The isolated polynucleotides of the present invention may further be used as 
hybridization probes for RNA. cDNA and genomic DNA, for example to isolate 
cDNA or genomic clones of other genes that have a high sequence similarity to 
the polynucleotides of the present invention. Such probes will comprise at least 
5 15 bases, preferably at least 30 bases, but may have even more than 50 bases. 

Preferred isolated or recombinant polypeptides of the present invention are those 
showing the activity of glycosyltransferases involved in biosynthesis of the 
bacterial lipopolysaccharides of bacteria of the genus Helicobacter, more 

10 particularly lipopolysaccharides of the species Helicobacter pylori and various 
strains thereof. Most particularly preferred are polypeptides coded by 
polynucleotides having sequences shown in Table 1 (SEQ ID NOs: 1, 3, 5, 7, 9, 
11 f 13 and 15), and also those which have at least 50% identity to polypeptides 
shown in Table 1 (SEQ ID NOs: 2, 4, 6, 8, 10, 12, 14 and 16), preferably at least 

is 70% identity, more preferably at least 80% identity, most preferably at least 95% 
identity, polypeptides closely related thereto as well as fragments and variants 
thereof. Of the polypeptides having substantial identity to polypeptides of Table 1, 
preferred are those having the same biological function or activity as the 
polypeptides appearing in Table 1 . 

20 

Polypeptides having amino acid sequences shown in Table 1 correspond to 
those coded by open reading frames HP0826 (SEQ ID NO: 2), HP0159 (SEQ ID 
NO: 4), HP0479 (SEQ ID NO: 6) and HP1191 (SEQ ID NO:8) of the genomic 
DNA of H. pylori strain 26695, by open reading frames SS0826 (SEQ ID NO: 10), 

25 SS0159 (SEQ ID NO: 12) and SS0479 (SEQ ID NO: 14) of the genomic DNA of 
H. pylori strain SS1, and by open reading frame PJ0479 of the genomic DNA of 
H. pylori strain PJ1 . Among several others, these ORFs have been cloned and 
expressed in suitable host cells and their function has been determined in vitro 
using techniques well known to those skilled in the art and discussed more in 

30 detail further in this disclosure. 



Polypeptides of the present invention can be produced as discussed above in 
connection with recombinant nucleic acids of the present invention. They can be 
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recovered and purified from recombinant cell cultures by methods and techniques 
well known to those skilled in the art, including ammonium sulfate or thanoi 
precipitation, acid extraction, and various forms of chromatography, in particular 
ion exchange and high performance liquid chromatography (HPLC). Well known 
5 techniques for refolding protein may be employed to regenerate active 
conformation when the polypeptide is denaturated during isolation and/or 
purification. 

The invention also relates to methods of screening compounds, to identify those 
10 which enhance (agonists) or block (antagonists) the action of polynucleotides or 
polypeptides of the present invention. Of those, antagonists acting as 
bacteriostatic or bactericidal agents are of particular interest. Potential 
antagonists include small organic molecules, peptides, polypeptides and 
antibodies that bind to a polynucleotide or polypeptide of the present invention 
15 and therefore inhibit its activity. Polynucleotides and polypeptides of the present 
invention may be used to assess the binding of small molecule substrates and 
ligands from various sources, including cells, cell-free preparations, chemical 
libraries, and natural product mixtures. The substrates and ligands may be 
natural substrates and ligands or may be structural or functional mimetics. 

20 

Polypeptides of the present invention are particularly useful for screening 
chemical compounds modulating the enzymatic activity of glycosyltransferases of 
Helicobacter origin involved in the biosynthesis of bacterial lipopolysaccharides, 
to identify those which enhance (agonists) or inhibit (antagonists or inhibitors) the 

25 action of Helicobacter glycosyltransferases, in particular compounds that are 
bacteriostatic and/or bactericidal. The method of screening may involve high- 
throughput techniques and assays. In a typical assay, a synthetic reaction mix 
comprising a polypeptide of the present invention and a labelled substrate or 
ligand of such polypeptide is incubated in the absence and in the presence of a 

30 candidate substance, a potential agonist or antagonist of the enzyme under 
study. This capability is reflected in decreased binding of the labeled ligand or in 
decreased production of a product from the labeled substrate. Detection of the 
rate or level of production of the product from the substrate may be enhanced by 
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using a suitable reporter system, such as a colorimetrically labelled substrate 
which is converted into a colorimetrically assayable product or a reporter gene 
responsiv to changes in the enzymatic activity of the polypeptide. 

5 The polypeptides of the present invention showing enzymatic activity of 
Helicobacter glycosyltransferases are also useful for the enzymatic synthesis of 
bacterial lipopolysaccharides and fragments thereof. When included in suitable 
reaction mixtures, these polypeptides catalyze the transfer of mono- or 
oligosaccharide residues to a suitable acceptor. In a preferred embodiment, the 

10 polypeptides of the present invention are used for the preparation of various 
mimics, analogues and derivatives of Helicobacter lipopolysaccharides. 

In yet another aspect, the invention provides novel mutants of Helicobacter 
bacteria, in particular mutants of H. pylori, having mutated (deactivated) genes of 

15 glycosyltransferases involved in the biosynthesis of bacterial lipopolysaccharides, 
in particular of the core or O-chain regions of LPS. Structural analysis of LPS 
isolated from the mutants confirmed that O-chain synthesis has been affected by 
the mutations and revealed the exact structure of the truncated LPS molecules. 
The mutant strains were also shown to have a reduced capacity of gastric 

20 colonization. 

The mutant bacteria expressing the truncated LPS and the LPS isolated from 
such mutants are useful as sources of antigens to be used in vaccination against 
Helicobacter bacteria, in particular against H. pylori. Such vaccines are normally 

25 prepared from dead bacterial cells, using methods well known to those skilled in 
the art, and usually contain various auxiliary components, such as an appropriate 
adjuvant and a delivery system. A delivery system aiming at mucosal delivery is 
preferred. Preferably but not essentially, the antigenic preparation is administered 
orally to the host, but parenteral admistration is also possible. Live vaccines 

30 based on H. pylori mutants may also be prepared, but would normally require an 
appropriate vector for mucosal delivery. Vaccines of the present invention are 
useful in preventing and reducing the number of H. pylori infections and indirectly 
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in reducing the incidence of pathological conditions associated with such 
infections, in particular gastric cancer. 

Chemically modified LPS isolated from mutants expressing the truncated LPS is 
5 a preferred antigen for use in vaccines according to the present invention. It is 
isolated from the bacteria and at least partially purified using techniques well 
known to those skilled in the art. Preparations of at least 70%, particularly 80%, 
more particularly 90%, most particularly 95% pure LPS are preferred. The purity 
of an LPS preparation is expressed as the weight percentage of the total 
10 Helicobacter antigens present in the preparation. The purified LPS can be used 
as antigen either directly or after being conjugated to a suitable carrier protein. 

In the following, the invention will be described in still greater detail, by way of 
examples and with respect to the preferred embodiments. 

15 

Identification and cloning of p-1,4-galactosy transferase 

A search of the H. pylori genomic database of translated proteins revealed three 
open reading frames (ORFs) (HP0826, HP0805 and HP0619) which exhibited 

20 limited homology with the lex2B gene from Haemophilus influenzae (39% 
identity) and the lob1 gene from Haemophilus somnus (32% identity). While both 
the lex2B and lob1 genes of Haemophilus have been shown to be involved in 
synthesis of the outer core region of the lipooligosaccharide (Jarosik ef al t Infect. 
Immun. 62: 4861-4867 (1994); Inzana ef a\, Infect Immun. 65: 4675-4681 

25 (1997)), to date no definitive function for either gene has been proposed. There 
is evidence that they are involved in addition of glucose (lex2B) and galactose 
{lob1) to the core heptose region. Both lex2B and lob1 show significant 
homology to a larger group of LOS biosynthesis proteins which include the H. 
influenzae lex1/lic2A genes (Cope et al, MoL Microbiol, 5: 1113-1124 (1994)) 

30 and lic2B gene (High et al, MoL MicrobioL 9: 1275 (1993)), Neisseria IgtB and 
IgtE genes (Wakarchuk ef a/, J. Bio. Chem. 271: 19166-19173 (1996)) and IpsA 
of P. haemolytica (Potter ef al, FEMS MicrobioL Lett. 129: 75-81 (1995) which are 
all involved in outer core assembly. The LgtB and LgtE proteins of N. meningitidis 

23 



WO 01/00796 



PCT/CA00/00T77 



have been shown to be galactosyltransferases involved in the transfer of 
galactose in a p-1,4 linkage in the terminal lacto-N-neotetraose structure. LgtB is 
responsible for the addition of Gal to GlcNAc, an identical function to that 
described here for HP0826, while LgtE catalyses the addition of Gal to Glc 
5 (Wakarchuk et a/, supra). Clustal multiple sequence alignment of HP0826 amino 
acid (aa) sequence and lex2B, lob1 and IgtB aa sequences from this group of 
related LOS biosynthesis proteins did identify two regions of conservation 
spanning the regions in HP0826 from approx. aa90 to aa142 and aa189 to aa235 
(see Fig 1). Limited homology was also observed with waaX from E. coli 
10 (Heinrichs et al, MoL Microbiol. 30: 221-232 (1998)), a putative core p-1,4- 
galactosyltransferase, only in the region spanning aa96-aa142 (data not shown). 
No significant homology was obtained with any putative glycosyltransferases 
involved in O-chain assembly from Gram-negative bacteria. 

15 Synthetic oligonucleotide primers which contained BamHI restriction sites which 
flanked the 5' and 3' ends of HP0826, HP0619, and HP0805 respectively,- were 
used in a PGR reactions containing chromosomal DNA of H. pylori 26695 or SS1 
as a template. A single PCR product was obtained in each case and this was 
cloned into pUC19 to give plasmids pHP0826, pHP0805, and pHP0619. DNA 

20 sequencing was used to confirm the identity of the cloned PCR products from 
26695 and SS1. 

Three additional open reading frames of H. pylon genome, HP0159, HP1191 and 
HP0479, have been identified by BLAST analysis as potentially coding for LPS 
25 glycosyltransferases. Of those, HP0159 displayed homology to the rfaJ, 
lipopolysaccharide 1 ,2-glucosyltransferase gene from a number of bacterial 
species, HP0479 and HP1191 displayed homology to waaC and waaF 
respectively, which are heptosyltransferase genes responsible for the addition of 
LD heptose to KDO in the core backbone. 

30 

Functional analysis of lex2B homologues 

P-1 ,4-galactosyltransferase activity has previously been detected in H. pylori 
(Chan et al, Glycobiology 5: 683-688 (1995)), but the gene(s) for this enzyme 
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have not been described. Enzyme activity was detected in extracts of E. coli 
pHP0826 but not from clones of HP0805 and HP0619 using the synthetic 
acceptor molecule FCHASE aminophenylp-GlcNAc and UDP-Gal as the donor. 
The lack of detectable activity in HP0805 and HP0619 clones could be a lack of 
the appropriate donor/acceptor molecule for their respective enzymatic activities. 
p-1 ,4-galactosyltransferase activity was also present in parent H. pylori strains 
but not in the H. pylori HP0826 mutants. The assays were followed by TLC 
analysis of the reaction mixtures as previously described (Gilbert et al, Eur. X 
Biochem. 249: 187-194 (1997)). A more sensitive capillary electrophoresis (CE) 
analysis of the reaction mixtures clearly demonstrated a loss of 
galactosyltransferase activity in the mutants. The product of the enzymatic 
reaction had an identical CE mobility compared to a known FCHASE- 
aminophenyl-p-N-acetyllactosamine standard, and was subjected to NMR 
analysis to determine the linkage. The 1 H and 13 C chemical shift data (Table 2) 
and 1D NOE analysis were consistent with the linkage of the Gal being p-1, 4 to 
the GlcNAc. The product was also sensitive to p-galactosidase. 
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Table 2. Linkage analysis of the product formed by HP0826 encoded protein. 

1 H and 13 C chemical shifts of the glycoside of Gal-(J-1 ,4-GicNAc-(3-FEX a 



Residue 


Position 


T T 

H 


c 


A, p-GlcNAc 


1 


4.86 


100.6 


2 


3.91 


55.8 


! 3 


3.72 


73.4 


I 4 


3.72 


79.0 




J 


3.46 


75.8 




A 

KJ 


3.74, 3.83 


60.8 




MAr 


1.91 


22.9 










R ft Hal 


1 

1 


4.46 


103.8 




z 


3.58 


72.0 






3.68 


73.4 




4 


3.94 


69.4 




5 


3.73 


76.3 




6 


3.77, 3.77 


62.0 










FEXas 




3.09 


29.4 


FEXms 




2.80 


36.9 


FEXxs 




3.57 


37.6 


FEXal 




6.92 


118.2 


FEXxl 




7.28 


124.4 


FEXa2 




7.17 


132.5 | 


EXm2 




7.70 


123.3 


FEXx2 




8.00 


121.5 


FEXa3 




7.22 


132.7 


FEXa3' 




7.13 


131.1 


FEXm3 




6.82 


121.5 


FEXx3 




6.91 


104.3 



a in ppm from the 600 MHz HSQC spectrum of the sample in D 2 0 at 35°C. Chemical 
shifts are referenced to the methyl resonance of acetone set at 2.225 ppm for ! H and 
31 .07 ppm 13 C. The error is ± 0.03 ppm for 'H and ± 0.3 for 13 C chemical shifts. 
The AMX spin system of CH 2 -CH 2 -S-CH 2 is at 3.09 t 2.80, 3.57 ppm with Jam=6.4 Hz and 
with their respective 13 C signals at 29.4, 36.9 and 37.6 ppm. The aminophenyl A 2 X 2 spin 
system is at 6.92 and 7.28 ppm with Jax=8.7 Hz and their respective 13 C signals at 1 18.2 
and 124.4 ppm. The three AMX spin system for fluorescein carboxamido group with 
Jam=8-9 Hz and Jmx= 1-2 Hz are at (7.17, 7.70, 8.00), (7.22, 6.82, 6.91) and (7.13, 6.82, 
6.91) ppm. Their respective 13 C signals are at (132.5, 123.3, 121.5), (132.7, 121 5 
104.3) and (131.1, 121.5, 104.3) ppm. 
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Functi nal analysis of rfaJ homologue (HP0159) 

Enzyme activity was detected in extracts of E. coli pHP0159 using the synthetic 
acceptor molecule FCHASE aminophenyl-a-maltose or FCHASE aminophenyl-a- 
glucose and UDP-GIc as the donor. Activity was also present in parent H. pylori 

5 strains but not in H. pylori HP0159 mutants. The assays were followed by TLC 
and CE analysis of the reaction mixtures as previously described (Gilbert et al, 
Eur. J. Biochem. 249: 187-194 (1997)). The more sensitive capillary 
electrophoresis (CE) analysis of the reaction mixtures demonstrated a loss of 
glucosyltransferase activity in the mutants. The product of the enzymatic reaction 

10 was subjected to NMR analysis to determine the linkage (Table 3). The 1 H and 
13 C chemical shift data, and 1 D NOE analysis were consistent with the linkage of 
Glc being a-1,6 to the Glc. 
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Table 3. Linkage analysis of the product formed by HP0159 encoded protein. 
1 H and 13 C chemical shifts of Glc-a-1,6-Glc-a-1,6-Glc-a-FEX a 



Residue 


Position 


H 


C 


A, a-Glc-FEX 


1 


5.35 


98.3 




2 


3.62 


72.1 




3 


3.80 


74.1 




4 


3.48 


70.6 




5 


3.72 


72.1 




6 


3.43, 3.69 


66.5 










B, a-GIc 


1 


4.74 


98.8 




2 


3.47 


72.2 




3 


3.61 


74.3 




4 


3.48 


70.6 




5 


3.73 


71.2 




6 


3.59, 3.87 


66.5 










(terminal^ 

^IW 111. 11.1411 I 


1 


4.89 


98.8 




2 


3.52 


72.5 




3 


3.70 


74.1 




4 


3.41 


70.5 




5 


3.67 


72.8 




6 


3.74,3.79 


61.5 










FEXas 




3.02 


29.3 


FEXms 




2.74 


36.9 


FEXxs 




3.52 


37.5 


FEXal 




7.00 


118.6 


FEXxl 




7.27 


124.2 


FEXa2 




6.92 


131.9 


FEXm2 




7.60 


124.6 


FEXx2 




8.07 


120.7 


FEXa3 




6.95 


132.0 


FEXa3* 




6.92 


131.9 


FEXm3 




6.69 


119.6 


FEXx3 




6.79 


104.1 



a in ppm from the 600 MHz HSQC spectrum of the sample in D 2 0 at 40°C. Chemical 
shifts are referenced to the methyl resonance of acetone set at 2.225 ppm for 'H and 
31 .07 ppm for 13 C. The error is ± 0.03 ppm for 1 H and ± 0.3 for 13 C chemical shifts. 
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Functional analysis of waaF homologu (HP1 1 91 ) 

Complementation analysis was used to determine the function of the HP1191 
from Helicobacter pylori strain 26695. The H. pylori HP1191 gene was amplified 
by PGR (see Table 6 for primer sequences used) and cloned into pUC19 to 
5 obtain pHP1191. WaaF mutant strain S. typhimurium 3789 was electroporated 
with this recombinant plasmid, and one of the resultant transformants selected for 
further study. SDS-PAGE was used to analyze LPS molecules produced by the 
relevant S. typhimurium strains. The LPS of the wild type strain formed the 
ladder like pattern indicative of the presence of the O antigen repeat unit whereas 

io the LPS of the S. typhimurium waaF mutant appeared as a single fast migrating 
band. The migration pattern of this mutant was not affected by the presence of 
the plasmid vector. However, when the H. pylori gene HP1191 was present in 
trans in strain 3789, this S. typhimurium mutant synthesized an LPS which 
migrated in a pattern identical to that obtained with the LPS of the wild type 

15 strain. This confirmed the activity of HP1191 to be involved in catalyzing the 
addition of a second heptose molecule onto the heptose linked directly to KDO in 
the core. 

Construction of H. pylori mutants carrying a disrupted HP0826 gene 

20 In order to determine the role of the HP0826 ORF in LPS biosynthesis, H. pylori 
mutants carrying a disrupted HP0826 gene were constructed by allelic exchange. 
Briefly, the HP0826 ORF cloned in pUC19 was disrupted by using reverse 
primers STAC AG ATCGCTTC ATTG AGTTCT3 ' and 

5'CCAAGAGTTAGGCTATATCCGCTT3' in a PCR reaction and ligating a 

25 kanamycin resistance cassette (or Km r ) to the gel purified product to make 
plasmid pHP0826::kan. H. pylori strains 26695, NCTC11637, 0:3 and Sydney 
strain (SS1) were transformed with plasmid pHP0826::kan DNA following the 
procedure of Haas et al, Mol. Microbiol. 8:753-760 (1993). This construct 
contains 515bp of homologous DNA upstream of the mutation and 464bp 

30 downstream of the mutation. Following transformation, cells were plated on blood 
agar containing kanamycin (20 ^ig/ml). Km r colonies were isolated and passaged 
on the same medium. Individual colonies were selected and screened for the 
presence of a double cross over mutation in the chromosome of the kan mutant. 
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To assess the insertion site of the disrupted gene PCR analysis was us d r with 
chromosomal DNA from parent and mutant H. pylori strains as templates and the 
primer pair 5'ACACTGGCATCATACAAT3' and 

S^CATGCGAAGTrTATGAGCTS' which are internal in the structural gene. This 
5 analysis demonstrated conclusively that the Km r cassette was inserted into the 
chromosomal copy of HP0826. The primer pair amplified the expected 212bp 
fragment in the parent strain, but resulted in a 1 .6kb fragment consistent with 
insertion of the 1 .4kb Km r cassette. Plasmid vector sequences were not detected 
by Southern blotting and a single 1.7kb Hind III fragment corresponding to 

10 insertion of the kan cassette in the HP0826 ORF was present in chromosomal 
DNA's of 26695::0826kan mutant and SS1::0826kan mutant but not in parental 
DNA when probed with the kan cassette. These data confirm that the insertion 
mutant was the result of a double cross-over event. Four kanamycin resistant 
transformants were independently tested to verify that gene disruption and gene 

15 replacement had occurred. All four mutants grew normally in vitro (as assessed 
by OD vs viable numbers) and produced a truncated LPS as assessed by 
electrophoretic mobility on SDS-PAGE gels. The overall protein composition of 
the total membrane fraction was unchanged in the knockout mutants of SS1 and 
26695 as assessed by SDS-PAGE and Coomassie blue staining. The 

20 contribution of polar effects to the phenotype of the HP0826 mutant is highly 
unlikely as a transcriptional terminator lies immediately downstream of the 
HP0826 ORF, the transcriptional organization switches strands and the 
downstream annoted ORF HP0827 is unrelated to LPS biosynthesis. 

25 The construction of H. pylori mutants carrying disrupted HP0159 and HP0479 
genes was carried out in essentially the same manner as above. 

Genomic Organization and Allelic Variation of SS1 

To ascertain if the structural organization found in 26695 and J99 is conserved 
30 within the SS1 genome, PCR amplification and sequencing of the HP0826 
homologue and flanking sequence was obtained from SS1. As with 26695 and 
J99, the upstream and downstream ORFs are conserved although variation in 
the intervening sequence was evident. Allelic variation of SS1 HP0826 resulted 
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in 31 base pair differences between SS1 and 26695 and 46 base pair 
differences between SS1 and J99. These differences in DNA sequence results in 
a total of 9 amino acid changes in the SS1 protein when compared with 26695 
and J99 amino acid sequences. In both comparisons the variations were located 
5 predominately at the N and C terminal region of the protein. 

SDS-PAGE analysis of H. pylori HP0826 mutants 

To characterize the effect of the HP0826 mutation on LPS structure in K pylori, 
proteinase K digested whole cell lysates from both parent and mutant cells grown 

io in broth were analyzed by SDS-PAGE. Silver staining revealed significant 
differences in the electrophoretic mobility of LPS isolated from parent and mutant 
cells of each strain examined. LPS from strains 26695, SS1, 0:3 and 
NCTC11637 appeared to have typical high molecular weight, smooth form LPS 
(S-LPS), while the HP0826 mutant of each strain no longer produced the S-LPS, 

is but appeared to produce a semi-rough type LPS. Immunoblotting with 
monoclonal antibodies to Lewis X (Le x ) and Lewis Y (Le y ) antigens confirmed that 
the LPS from all mutants no longer displayed immunoreactive material of high 
molecular weight typical of the corresponding parental O-chain which displays 
Lewis antigens. 

20 

SDS-PAGE analysis of H. pylori HP0159, 0479 and 1191 mutants 

To characterize the effect of the HP0159, 0479 and 1191 mutations on LPS 
structure in H. pylori, proteinase K digested whole cell lysates from both parent 
and mutant cells grown in broth were analyzed by SDS-PAGE. Silver staining 
25 revealed significant differences in the electrophoretic mobility of LPS isolated 
from parent and mutant cells of each strain examined. In all cases, LPS from 
mutant cells no longer produced S-type LPS but instead only a fast migrating 
rough type LPS was observed. 

30 Structural investigations of H. pylori HP0826 LPS mutants of strains 26695, 
SS1, and NCTC 11637 

The LPS molecules of H. pylori strains 26695, SS1 ( M. A. Monteiro et al, Eur. J. 
Biochem. 267: 305-320 (2000) and type strain NCTC 11637 (Aspinall et a/, 
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supra) have been determined to carry O- chain regions that xpress Le x and Le y 
blood-group determinants. These Lewis-mimicking O chains were shown to be 
covalently connected to a core oligosaccharide. Sugar composition analysis 
(Table 4) of the intact LPSs of K pylon 26695::HP0826kan, SS1::HP0826kan 
and NCTC 11637::HP0826kan demonstrated a clear reduction in levels of those 
sugars known to form the O chain components, namely L-Fuc, D-Gal and D- 
GlcNAc, when compared to parent LPSs. 

Table 4. Approximate molar ratios of the alditol acetate derivatives of 26695, SS1 and 
NCTC 1 1637 HP0826 isogenic mutants intact LPSs. Numbers in parentheses 
indicate ratios obtained for respective parent strains. Analyses performed on 
LPS prepared from broth grown cells. 



Strain 


L-Fuc 


D-GIc 


D-Gal 


GlcNAc 


DD-Hep 


LD-Hep 


26695::Hp0826kan 


0.8 (6) 


6(7) 


1(10) 


1 (8) 


2(2) 


1.8(1.6) 


SS1::Hp0826kan 


0.8 (6) 


2(2) 


1(10) 


K8) 


2(2) 


1.8 (1.6) 


NCTC11637::Hp0826kan 


0.8 (6) 


6(7) 


1 (10) 


1 (8) 


2(2) 


1.8(1.6) 



Methylation linkage analysis performed on the intact H. pylori mutant LPSs from 
each strain showed the presence of terminal and 3-substituted Fuc, terminal, 3-, 
and 6-(except in SS1 strain) substituted Glc, terminal, 3- and 4-substituted Gal, 2- 
(only in 26695), 3-(only in 26695), 6-(only in 26695), 7- and 2,7-substituted DD- 
Hep, 2- and 3,7-substituted LD-Hep, and terminal and 3-substituted GlcNAc 
units. All sugars were present in the pyranose conformation. In order to obtain 
sugar sequence information of the outer-extremities of the LPS molecule (O- 
chain perimeter), a fast atom bombardment-mass spectrometry (FAB-MS) 
experiment in the positive ion mode was carried out on the methylated intact 
mutant LPSs from each strain. The FAB-MS spectra showed several A-type 
primary glycosyl oxonium ions of defined composition. The trace amounts of 
terminal GlcNAc that were observed in the linkage analyses were also detected 
in each of the three mutant LPS FAB-MS spectra at m/z 260 [GlcNAc]* (Fig. 2). 
A-type primary glycosyl oxonium ions containing L wis blood-group related Fuc 
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and GlcNAc residues were observed at m/z 434 [Fuc, GlcNAc]*, 508 [GIcNAc, 
Hep]*, and 682 [Fuc f GlcNAc, Hep]*. The ion m/z 434 stood for a disaccharide 
composed of Fuc and GlcNAc and ion m/z 508 characterized a possible 
connection between the O-chain related GlcNAc and a heptose from the core 

5 region. The ion m/z 682 [Fuc, GlcNAc, Hep] + represented a moiety containing 
GlcNAc and Fuc residues of the O-chain region and a single heptose unit from 
the core region which bridges the O-chain and the core oligosaccharide. Since 
no terminal Hep unit was detected, these m/z 508 and 682 ions must originate 
from cleavage at the heptose glycosidic bond and represent a partial O-chain 

io repeating unit [Fuc, GlcNAc, Hepf . No 3,4-substituted GlcNAc, 2-substituted Gal 
and no m/z 638 (characteristic of Le x ) and 812 (characteristic of Le y ) glycosyl 
oxonium ions were detected, and therefore no evidence of an O-chain containing 
Le x or Le y determinants was found in these analyses of 26695::HP0826kan, 
SS1::HP0826kan and NCTC 11637::HP0826kan LPSs. In addition, higher mass 

is ions in the FAB-MS spectrum of NCTC1t637::HP0826kan at m/z 886 [Fuc, 
GlcNAc, Hep, Glcf, 1090 [Fuc, GlcNAc, Hep, Glc 2 r, and 1294 [Fuc, GlcNAc, 
Hep, Glc 3 ] + (Fig. 2) represented the characteristic glucosylated by a [(1-6)-a- 
glucan] heptose unit (Aspinall et a/, supra) in strain NCTC 1 1637 and 26695 (Fig. 
2). The same primary ions were also observed in the FAB-MS spectrum of the 

20 methylated LPS of 26695::HP0826kan, but not of SS1::HP0826kan, in line with 
the structural findings in the parent strains (M. A. Monteiro, unpublished). In the 
three FAB-MS spectra, the primary ion m/z 668 and its corresponding secondary 
ion m/z 228 (Fig. 2) pointed to the presence of the type 1 linear B blood-group 
[Gal-(1-3)-Gal-(1-3)-GlcNAc] antigen, a blood-group determinant found in the 

25 LPSs of 26695, SS1 (M. A. Monteiro, unpublished), and in NCTC 11637 
(Monteiro ef a/, J. Biol Chem. 273: 11533-11543 (1998)). The glycose units 
emanating from the core oligosaccharide regions were of the same type as those 
found in the respective parent LPSs. The GlcNAc and Fuc units observed were 
remnants of an incomplete O chain. A comparison of the structures identified in 

30 parent and mutant LPS from 26695 and SS1 and the respective HP0826,0159 
and 0479 isogenic mutants is presented in Fig 3. 
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Structural characterization of H. pylori LPS mutants 26695::HP0159kan and 
SS1::HP0159kan 

Growth of bacterial strains and isolation of LPS by hot aqueous phenol method 
5 were carried out as described previously (Logan et a/, Mol. Microbiol. 35: 1156- 
1167 (2000)). Sugar analysis of the intact LPS of H. pylori 26695:: HP0159kan, 
SS1:: HP0159kan f 0:3:: HP0159kan showed significant reduction in L-Fuc, D- 
Gal, and DD-Hep (for serotype 0:3 mutant) when compared with the parent LPS 
indicating the presence of the structure devoid of O-chain and DD-heptan. 

10 Methylation analysis of the intact LPS from each strain showed the presence of 
terminal and 3-substituted L-Fuc, terminal and 4-substituted D-GIc, terminal, 3- 
and 4-substituted D-Gal, terminal, 2-, 6-, 7- and 2,7-substituted DD-Hep, terminal, 
2- and 3-substituted LD-Hep and terminal, 3-substituted and 4-substituted D- 
GlcNAc. All sugars were present in the pyranose form. In addition, methylation 

15 analysis of LPS from 26695::HP0159kan and O:3::HP0159kan revealed the 
presence of 4-substituted D-GIc, no 6-substituted D-GIc was observed. NMR 
analysis of a high molecular mass fraction, isolated by gel filtration 
chromatography from a partially delipidated LPS (1.5% acetic acid, 1h, 100°C) 
from 26695:: HP0159kan by gel filtration chromatography, indicated it to contain 

20 p-1 ,4-linked glucan, a contaminant produced by some strains of H. pylori (Knirel 
et a/, Eur J. Biochem. 266: 123-131 (2000)). In order to deduce the sequence 
information on the outer extremities of the LPS molecule, permethylated intact 
LPS from each strain was subjected to the fast-atom-bombardment mass 
spectrometry analysis in the positive mode. A-type primary glycosyl oxonium 

25 ions containing Lewis blood group related Fuc and GlcNAc residues were 
observed at m/z 260 [GlcNAc]* and m/z 682 [Fuc,GlcNAc, Hep]*. No higher mass 
ions representing a glucosylated DD-heptose unit were detected. This evidence 
together with the absence of 6-substituted glucose in methylation analysis 
indicated this LPS mutant to be deficient in the biosynthesis of <x(1-6)-glucan 

30 present in both 26695 and 0:3 parent strains. Absence of the 3-substituted 
glucose in methylation analysis of LPS from 26695::HP0159kan, 
SS::HP0159kan, suggested that addition of a 1,3-linked glucopyranosyl residue 
was also impaired by this mutation. In the three FAB-MS spectra, the primary ion 
m/z 668 and its corresponding secondary ion m/z 228 pointed to the presence of 
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the type 1 linear B blood group [Gal(1-3)Gal(1-3)GlcNAc] antigen, a blood group 
antigen found in the LPS of 26695 and SS1 (Monteiro et al t Eur. J. Biochem. 
267:305-320 (2000)). Other Lewis blood group-related secondary ions were 
observed at m/z 228 (260-32) [GlcNAc] *. 402 (434-32) [FucGlcNAc]*; 576 (608- 
5 32) [Fuc (1-3)Fuc (1-4)GlcNAcf as previously described (Monteiro et al, J. Biol. 
Chem. 273: 11533-11543 (1998), Logan et a/, Mol. Microbiol. 35: 1156-1167 
(2000)). 

LPS from 26695::HP0159kan was treated with 0.1 M sodium acetate buffer, pH 
10 4.2 (2 h, 100°C) and following the removal of lipid A by low speed centrifugation, 
subjected to the gel filtration chromatography on a Bio-Gel P-2 column, followed 
by capillary, electrophoresis-electrospray mass spectrometry (CE-ESMS) as 
described previously (Thibault and Richards, Meth. Mol. Biol. 145: 327-343 
(2000)). The CE-ESMS spectrum of the delipidated LPS confirmed the presence 
15 of a major glycoform produced by the 26695::HP0159 mutant LPS, 
corresponding to FucGlcNAcHex 2 Hep 4 (PE)KDO (m/z 902, doubly protonated 
ion). MS-MS of the doubly charged ion (m/z 902) (Fig. 4) afforded a singly 
charged fragment at m/z 1601 consistent with the loss of GlcNAc (and its 
anhydro form at m/z 1583) which subsequently lost Fuc and Hep residues to 
20 afford a fragment ion at m/z 1262. A comparison of the structures identified in 
parent and HP0159 mutant LPS is presented in Fig. 3. 

Structural characterization of H. pylori LPS mutants 26695::HP0479kan and 
SS1::HP0479kan. 

25 

Sugar analysis of the HP0479 LPS mutants indicated reduction in the amount of 
L-Fuc, D-Gal and DD-Hep and methylation analysis confirmed this. Methylation 
analysis of the intact LPS from each strain indicated absence of 3-substituted and 
6-substituted D-GIc, 3-substituted DD-Hep and 6-substituted DD-Hep (for 
30 O:3::HP0479 and 26695::HP0479 LPS) and a significant decrease in 2- 
substituted DD-Hep, suggesting deficiencies in the core biosynthesis. 

FAB-MS analysis in the positive mode of the permethylated LPS from each strain 
indicated the presence of primary glycosyl oxonium ions at m/z 260 [GlcNAc]* 
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and m/z 434 [Fuc,GlcNAc]* and secondary glycosyl oxonium ions at m/z 228 
(260-32) [GlcNAc]* and m/z 402 (434-32) [Fuc,GlcNAc] + . This evidence together 
with the absence of the primary glycosyl oxonium ion at m/z 682 [Fuc, GlcNAc, 
Hep] + suggested that the mutant LPS structure was lacking DD-Hep residue 
5 which bridges O-chain and the core oligosaccharide in the respective parent LPS 
(Monteiro et a/, Eur. J. Biochem. 267: 305-320 (2000), Logan et a/, MoL 
Microbiol. 35: 1168-1179 (2000)). LPS from SS1:: HP0479 and 26695 was 
delipidated and desalted following gel filtration chromatography on a Bio-Gel P-2 
column. Fractions containing core oligosaccharide components were subjected to 

10 the mass spectrometry analysis using combined capillary zone electrophoresis- 
electrospray-mass spectrometry (CZE-ESMS) in the positive mode, followed by 
MS/MS analysis of the most abundant oligosaccharide fragments. The product 
ion spectrum showed two major singly charged fragment ions at m/z 1612 and 
m/z 1246, containing an anhydro-KDO. The fragment ion at m/z 1612 could be 

15 assigned to the glycoform FucGlcNAcHex 2 Hep 3 (PE)KDO (Fig. 5), based on the 
linkage and FAB-MS analyses data and recent structural studies (Monteiro et al, 
Eur. J. Biochem. 267: 305-320 (2000)). The MS/MS spectrum of m/z 1246 was 
consistent with the core fragment Hex 2 Hep3(PE)KDO as confirmed by a 
consecutive cleavage of glycosidic bonds yielding a direct sequence assignment. 

20 These structural assignments are consistent with the presence of 2,7-substituted 
DD-Hep, 7-substituted DD-Hep and 2-substituted DD-Hep in the methylation 
analysis of LPS mutants 26695::HP0479kan, SS1::HP0479kan, O:3::HP0479kan. 
Absence of the first DD-Hep which serves as a link between the O-chain and the 
core oligosaccharide and is glycosylated by 1,6-glucan, resulted in the loss of O- 

25 chain and DD-heptan (for serotype 0:3). A comparison of the structures identified 
in parent and HP0479 mutant LPS is presented in Fig. 3. 

Mouse Colonization Studies 

The role of S-type LPS in gastric colonisation was investigated using the SS1 
30 strain of H. pylori which others (Lee ef a/, Gastroenterology 112: 1386-1397 
(1997); Ferrero et a/, Infect Immun. 66: 1349-1355 (1998); Conlan et at, Can. J. 
Microbiol. 45:975-980 (1999)) have shown to be capable of colonising the 
stomachs of mice, including the CD1 strain used in the present study. Both 
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parental SS1 and SS1 HP0826 mutant which was obtained by natural 
transformation w re used to orogastrically inoculate mice. The parent SS1 cells 
produce considerable amounts of S type LPS displaying Lewis Y epitopes while 
cells in which HP0826 has been inactivated produce a faster migrating, rough 
5 type LPS molecule no longer displaying Lewis epitopes. To minimise the 
likelihood that any observed differences in in vivo behaviour arose as a result of 
exogenous influences, care was taken to ensure that the mutant and parental 
strains underwent equivalent in vitro manipulations before being gavaged into 
mice. In an initial experiment, groups of mice were gavaged with either wild-type 

10 or mutated H. pylori SS1. Representative mice from each group were killed 6 or 
12 weeks later and the stomach burdens of H. pylori, and level of Helicobacter- 
specific circulating immunoglobulin G determined. By 6 weeks of infection, 5.65 
+/- 0.26 logioCFU (colony-forming units) of wild-type bacteria were recovered 
from the stomachs of mice (n=4) challenged with this organism, whereas only 

is 4.27+/- 0.26 logioCFU of the mutant bacteria were recovered from the stomachs 
of mice gavaged with it. This 24-fold decreased recovery of mutant versus wild- 
type H. pylori SS1 was statistically significant according to the Mann-Whitney 
Rank Sum Test (p<0.05). Similarly, by 12 weeks there was a 10-fold difference 
in numbers of wild-type (5.81+/-0.51 logioCFU, n=5) and mutant (4.79+/-0.43 

20 logioCFU, n=5) bacteria recovered, and this too was statistically significant 
(p<0.05). PCR performed on digested stomach tissue confirmed the above 
findings, indicating that the decreased recovery was not due to any innate 
unculturability of the mutant bacteria. Likewise, by 12 weeks of infection sera 
from mice infected with wild-type SS1 all reacted by ELISA against a sonicate of 

25 H. pylori as coating antigen (average IgG titre = 1270+/-2166) whereas only 3/5 
mice infected with mutant SS1 had seroconverted (mean IgG titre of 
seropositives = 123+/-94). Additionally, when either parental or mutant LPS was 
used as the coating antigen in ELISA, only mice infected with the parental strain 
of H. pylori showed evidence of seroconversion. 

30 

To determine whether the colonisation differences observed in the 
aforementioned experiment were due to an initial inability of the mutant strain to 
colonise or due to its subsequent elimination, a complementary experiment 
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examined gastric colonization levels of parental and mutated H. pylori SS1 at 1 
and 3 weeks post-challenge. By one-week post-challenge, 5.81+/-0.29 log 10 CFU 
(n=5) of wild-type bacteria, but only 3.94+/-0.33 log 10 CFU (n=5) of the mutant 
bacteria were recovered from the stomachs of the respectively infected mice. 
5 This 74-fold difference was statistically significant (P< 0.05) and convincingly 
shows that H. pylori SS1 bacteria unable to produce S-type LPS are significantly 
impaired in their ability to initially colonise the murine stomach. In this experiment, 
approximately 17-fold more wild-type than mutant H. pylori (5.4+/-0.34 log 10 CFU, 
n=5 versus 4.18+/-0.14 logioCFU, n=5) were recovered from the stomachs of 
10 relevant mice at three weeks of infection. 

Results of mouse colonization experiments for the parent (SS1) strain of H. pylori 
and their mutant strains SS0826, SS0159 and SS0479 are summarized in Table 
5. 

15 

Table 5. Mouse colonization data. Numbers in the table show levels of colonization of c 
mice stomachs (as logioCFU/stomach +/- standard deviation) after the indicated 
number of weeks (WK) of infection. ND: not determined BDL: less than 500 bacteria 

20 





STRAIN 


WK 1 


WK 3 


WK 6 


WK12 


WK 20 


EXP 1 


SS1 


5.81 +/-0.29 
(n = 5) 


5.40 +/- 0.34 
(n = 5) 


5.65 +/- 0.26 

(n = 4) 


5.81 +/-0.51 

(n = 5) 


ND 




SS0826 


3.94 +/- 0.33 
(n = 5) 


4.18 +/-0.17 
(n = 5) 


4.27 +/- 0.26 
(n = 4) 


4.79+/- 0.43 
(n = 5) 


ND 


EXP 2 


SS1 


5.43 +/- 0.03 

(n = 4) 


ND 


ND 


5.94 +/- 0.33 
(n = 5) 


5.84 +/-1.10 

(n = 5) 




SS0159 


3.37 +/- 0.20 

(n = 4) 


ND 


ND 


3.09 +/- 0.42 
(n = 5) 


< 3.76 

(n = 5) 


EXP 3 


SS1 


4.76 +/- 0.93 

(n = 5) 


ND 


ND 


5.02+/- 1.06 
(n = 5) 


ND 




SS0479 


BDL 

(n = 5) 


ND 


ND 


BDL 

(n=5) 


ND 



38 



WO 01/00796 



PCT/CA00/00777 



Exp 1 : Individual mice inoculated by gavage on D1 , D3, D6 with 0.2ml of 
broth grown cells suspended in PBS at cell concentration of -1 x 
10 10 /ml. 

5 Exp 2: Individual mice inoculated by gavage on D1 + D3 with 0.2ml of 
broth grown cells suspended in PBS at cell concentration of -2 x 
10 10 /ml. 

Exp 3: Individual mice inoculated by gavage on D1 and D3 with 0.2ml of 
10 broth grown cells suspended in PBS at cell concentration of 

4.7x1 0 r °/ml (D1) and 1x10 7 /ml (D3) 



The above data show that all the mutants with disrupted genes have a reduced 
is ability to colonize the murine stomach, as compared with the parent strain. 
SS0479 strain (H. pylori strain SS1 having disrupted gene HP0479) is the least 
capable of colonization. 



20 EXPERIMENTAL 

Bacterial strains and culture conditions 

Helicobacter pylori strain 26695 (Tomb et al, supra) used for the initial cloning 
was obtained from R. A. Aim, Astra, Boston. H. pylori strain SS1 was obtained 

25 from A. Lee. H. pylori reference strain ATCC43504 and H. pylori serogroup 0:3 
isolate were from J. Penner. PJ1 was a fresh clinical isolate of H. pylori. 
Helicobacter strains were grown on at 37°C on antibiotic supplemented (Lee et a/, 
supra) trypticase soy agar plates containing 7% horse blood (GSS agar) in a 
microaerophilic environment for 48h (Kan 20 ^g/ml). For growth in liquid culture, 

30 antibiotic supplemented Brucella broth containing 5% fetal bovine serum, was 
inoculated with H. pylori cells harvested from 48h trypticase soy agar/horse blood 
plates and incubated for 36h in a Trigas (Nuaire, Plymouth, MN) incubator (85% 
N 2 , 10%CO 2 , 5%0 2 ) on a shaking platform. Escherichia coli strain DH5a was 
used as host for plasmid cloning experiments and was grown on L-agar plates at 

35 37°C supplemented with ampicillin (50jigmr 1 ) and/or kanamycin (20jxgmr 1 ) 
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P-1,4-galactosy (transferase activity 

Glycosyltransferase assays were performed essentially as described previously 
(Gilbert et a/. f supra). Cells were scraped from a 3 day old plate culture of H. 
pylori, the cells were stored frozen at -20°C. Cell extracts were made by mixing 
5 the cell pellet with 2 volumes of glass beads, and grinding with a ground glass 
pestle in the microcentrifuge tube. The paste was extracted twice with 50 pi of 50 
mM MOPS-NaOH buffer pH 7.0. Reactions contained 0.5 mM FCHASE- 
aminophenyl-p-GlcNAc, 10 mM MnCI 2 , 0.5 mM UDP-Gal, 50 mM MOPS-NaOH 
pH 7.0, and 10 pi of cell extract in a final volume of 20 pi. For reactions with the 

10 cell extracts of H. pylori the reactions were incubated 3-5 h at 37°C, whereas 
with the extracts containing the recombinant enzyme the reactions times were 30 
- 60 min at 37°C. The TLC and CE analysis was performed as previously 
described (Gilbert et a/. t supra). For TLC analysis 0.5 pi of the reaction mixture 
were spotted and developed and for CE analysis samples were diluted to an 

15 FCHASE-aminophenyl-p-GlcNAc concentration of 10 pM prior to analysis. 

Recombinant DNA techniques and nucleotide sequence analysis 

DNA sequencing of PCR products was performed using an Applied Biosystems 
(model 370A) automated DNA sequencer using the manufacturers cycle 

20 sequencing kit. All standard methods of DNA manipulation were performed 
according to the protocols of Sambrook et a/, Molecular Cloning: A Laboratory 
Manual, 2 nd edn. Cold Spring Harbor, New York: Cold Spring Harbor Laboratory 
Press (1989). DNA probes for Southern blotting were labelled with DIG-11-dUTP 
using DIG-High Prime (Boehringer Mannheim, Montreal, Canada) and detection 

25 of hybridized probe with DIG Luminescent Detection Kit (Boehringer Mannheim 
Montreal, Canada). Primers used for the PCR gene amplification and for mutant 
constructs are shown in Table 6. 
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Table 6. Primer sequences for PCR amplification of HP0826, HP0159, HP0479 and 
HP1 1 91 genes and for construction of respective mutant strains . 



Primer 


Primer (5'-3* Iseauence 


HP0826-F1 


cggatccGGTTTTTATAGCCATGATGC 


HP0826-R1 


rnn atrrA A dCX PP PTT A A r^TTTT^TTP 


HP0826-mut1 

i in UU^U*I l IUI 1 


TAP A G> ATPfi CTTC A TTH A CTTPT 


I 1 » UO£U 1 1 IU \£- 


P P A A A fiTT AH P PT AT A TP m PTT 


HP0159-F1 


cgggatccTGTCAAATTCGCCTATAGCGT 


nrU lOy-Kl 


cgggatccAou i a 1 1 i i AovjoAAACCGCT 


HPD1 SQ-mut1 


pppppp i i i i iaptpptpaat 


HP0159-mut2 


AGGGAAAAGGCTTGACGAGG 


HP0479-F1 


GCCTTTATCAAGCTAGAG 


HP0479-R1 


CATAAATGTCCTAACAAGC 


HP0479-mutF1 


CAAAACCGCCAGGAGTTG 


HP0479-mutR1 


GGTTATGGGAATGAATTTGG 


HP1191-F1 


cgggatccCGGTCTTTAAACCCGCTCAACA 


HP1191-R1 


cgggatccCCGCTCTTCTCACGCCTTTAA 



Site specific mutagenesis of HP0826 

HP0826 clone of Helicobacter pylori strain 26695 was mutagenized in E. coli by 
ligation of the Km r cassette described by Labigne et al (J. Bacteriol. 170: 1704- 
1708 (1988)) to pUC19 containing the HP0826 gene. Deletion of a central 66 bp 
region of the gene was achieved by reverse PCR (Pwo polymerase, Boehringer 
Mannheim) using the outward primers 5TACAGATCGCTTCATTGAGTTCT3' and 
5'CCAAGAGTTAGGCTATATCCGCTT3' followed by blunt end ligation with the 
Km r cassette. The mutated allele was returned to Helicobacter by natural 
transformation according to the method of Haas et al (supra). 

Electrophoresis and Western blotting 

SDS-PAGE was performed with a mini-slab gel apparatus (Biorad) by the method 
of Laemmli (Nature 227: 680-685 (1970)). LPS samples were prepared from 
whole cells according to a previously described method (Logan et al, Infect 
Immun. 45: 210-216 (1984)), equivalent amounts loaded in each lane and 
stained according to Tsai et al (Anal. Biochem. 119: 115-119 (1982)) or 
transferred to nitrocellulose for immunological detection as previously described 
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(Logan et al, supra). Anti Lewis monoclonal antibodies (Signet Laboratories Inc, 
Dedham, MA) were used at 1:500 dilution. 

Isolation of membrane fraction 

5 Broth grown cells (18h) were harvested and resuspended in 20mM Tris (pH 7.4). 
Following sonication (3x60sec) intact cells were removed by centrifugation at 
4000xg, and membranes sedimented by centrifugation at 40,000xg, washed in 
20mM Tris (pH7.4) recentrifuged, and resuspended in 0.5m! 20mM Tris (pH7.4). 
Equivalent amounts of SS1, 26695 parent and mutant strains were analyzed by 
10 SDS-PAGE and stained by Coomassie Blue. 

Isolation of Lipopoly saccharides 

The LPSs were isolated by the hot phenol-water extraction procedure (Westphal 
etal, Meth. Carbohydr. Chem. 5: 83-91 (1965)). The LPSs were purified by gel- 
is permeation-chromatography on a column of Bio-Gel P-2 (1m x 1cm) with water 
as eluent. In all cases, only one carbohydrate positive fraction was obtained 
which eluted in the high M r range (Dubois et al, AnaL Chem. 28: 350-356 (1956)). 
These intact H. pylori LPSs then were used for chemical analyses. 

20 Sugar Composition and Methylation Linkage Analyses 

Sugar composition analysis was performed by the alditol acetate method 
(Sawardeker et al, AnaL Chem. 39:1602-1604 (1967)). The hydrolysis was done 
in 4M trifluoroacetic acid at 100°C for 4h or 2M trifluoroacetic acid at 100°C for 
16h followed by reduction in H 2 0 with NaBD*. and subsequent acetylation with 

25 acetic anhydride and with residual sodium acetate as the catalyst. Alditol acetate 
derivatives were analyzed by gas-liquid-chromatography mass-spectrometry 
(GLC-MS) using a Hewlett-Packard chromatograph equipped with a 30 m DB-17 
capillary cplumn [210°C (30 min) to 240°C at 2°C/min] and MS in the electron 
impact (El) mode was recorded using a Varian Saturn II mass spectrometer. 

30 Methylation linkage analysis was carried out by the NaOH/DMSO/CH 3 l procedure 
(Ciucanu et al, Carbohydr. Res. 131: 209-217 (1984)) and with characterization 
of permethylated alditol acetate derivatives by GLC-MS in the El mode (DB-17 
column, isothermally at 190°C for 60 min). 
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Fast Atom Bombardment-Mass Spectrometry (FAB-MS) 

A fraction of the methylated sample was used for positive ion fast atom 
bombardment-mass spectrometry (FAB-MS) which was performed on a Jeol 
5 JMS-AX505H mass spectrometer with glycerol(1) : thioglycerol(3) as the matrix. 
A 6 kV Xenon beam was used to produce pseudo molecular ions which were 
then accelerated to 3kV and their mass analyzed. Product ion scan (B/E) and 
precursor ion scan (B 2 /E) were preformed on metastable ions created in the first 
free field with a source pressure of 5x1 0" 5 torr. The interpretations of positive ion 
10 mass spectra of the permethylated LPS derivatives were as previously described 
by Dell ef a/ (Carbohydr. Res. 200: 59-67 (1990). 

Electrospray mass spectrometry 

Samples were analyzed on a crystal Model 310 CE instrument (ATI Unicam, 

15 Boston, MA, USA) coupled to an API 3000 mass spectrometer (Perkin- 
Elmer/Sciex, Concord, Canada) via a microlonspray interface. A sheath solution 
(isopropanol-methanol, 2:1) was delivered at a flow rate of 1 ^L/min to a low dead 
volume tee (250 \im i.d., Chromatographic Specialties, Brockville, Canada). All 
aqueous solutions were filtered through a 0.45-jim filter (Millipore, Bedford, MA, 

20 USA) before use. An electrospray stainless steel needle (27 gauge) was butted 
against the low dead volume tee and enabled the delivery of the sheath solution 
to the end of the capillary column. The separation were obtained on about 90 cm 
length bare fused-silica capillary using 10 mM ammonium acetate/ammonium 
hydroxide in deionized waster, pH 9.0, containing 5% methanol. A voltage of 25 

25 kV was typically applied at the injection. The outlet of the capillary was tapered to 
ca. 15 ^m i.d. using a laser puller (Sutter Instruments, Novato, CA, USA). Mass 
spectra were acquired with dwell times of 3.0 ms per step of 1 m/z unit in full- 
mass-scan mode. For CZE-ES-MS/MS experiments, about 30 nL sample was 
introduced using 300 mbarfor 0.1 min. The MS/MS data were acquired with dwell 

30 times of 1.0 ms per step of 1 m/z unit. Fragment ions formed by collision 
activation of selected precursor ions with nitrogen in the RF-only quadrupole 
collision cell, were mass-analyzed by scanning the third quadrupole. Collision 
energies were typically 60 eV (laboratory frame of reference). 
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M use Colonization 

Specific Pathogen free Female CD1 mice were purchased from Charles Rivers 
Laboratories, Montreal when they were 6-8 weeks old. Mice were maintained 
5 and used in accordance with the recommendations of the Canadian Council on 
Animal Care, Guide to the Care and Use of Experimental Animals (1993). Mice 
were inoculated with bacteria harvested from 36h broth culture. Aliquots of 0.2 
ml, containing approximately 10 8 bacteria resuspended in PBS were given by 
gavage directly into the gastric lumen using a 20g gavage needle. Three inocula 

10 were given over a period of 6 days. No attempt was made to neutralize gastric 
acidity prior to inoculation. To recover viable bacteria from the stomach, mice 
were killed by CO2 asphyxiation, and their stomachs removed whole. Stomachs 
were cut open along the greater curvature, and the exposed lumenal surface was 
gently irrigated with 10 ml of sterile PBS, delivered via a syringe fitted with a 20g 

15 gavage needle, to dislodge the loosely adherent stomach contents. This step 
effectively diminished the small numbers of ubiquitous contaminating bacteria 
that otherwise overgrow on GSS agar to thereby mask the presence of the slower 
growing H. pylori organisms. The washed stomach tissue was then 
homogenised, and serial dilutions plated on GSS agar. H. pylori colonies were 

20 counted following 3-6 days incubation. 

Detection of ft pylori specific antibodies by ELISA 

Sera for antibody determinations were prepared from clotted blood obtained from 
a lateral tail vein during the course of an experiment or by cardiac puncture at the 

25 time of necropsy. Sera were screened for the presence of specific IgG isotype 
anti- H. pylori antibodies by ELISA essentially by the method of Engvall et al (J. 
Immunol. 109: 129-135 (1972)). Briefly, microtitre plates (Dynatech Immunolon 
II) were coated with 100 ^l antigen (50 ng protein/ml in 0.05M carbonate buffer 
pH 9.8) and incubated overnight at 4°C. Antigen was prepared by resuspending 

30 plate grown H. pylori in PBS and sonicating the suspension until a translucent 
solution was obtained. The sonicate was membrane filter sterilized through a 
0.45 (im filter. The protein content of the filtrate was determined by Lowry assay 
using a commercial kit. Sodium azide was added to 0.05% w/v and the antigen 
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solution was stored at 4°C. When LPS was used as the coating antigen th 
concentration was 10jig/ml. Sera were screened at a starting dilution of 1/40 and 
were titrated through a two-fold dilution series down a column of 8 wells. The 
developing antibody was goat-anti-mouse IgG conjugated to alkaline 
phosphatase (Caltag Laboratories). Titres were determined from plots of 
absorbance at 410 nm versus dilution and were defined as the reciprocal of the 
dilution giving an A^o equivalent to 0.25. Standard negative and positive control 
sera identified by a preliminary ELISA of candidate samples were included on 
each plate. Titres were analysed statistically by Mann Whitney Rank Sum Test 
and were considered to be significantly different to comparative samples when p 
values <0.05 were obtained. 

Although various particular embodiments of the present invention have been 
described hereinbefore for purposes of illustration, it would be apparent to those 
skilled in the art that numerous variations may be made thereto without departing 
from the spirit and scope of the invention, as defined in the appended claims. 
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WHAT IS CLAIMED IS: 

1 . An isolated or recombinant polynucleotide encoding at least a portion of a 
Helicobacter glycosyltransferase involved in the biosynthesis of a 
Helicobacter lipopolysaccharide (LPS). 

2. A polynucleotide according to claim t, wherein the glycosyltransferase is 
involved in the biosynthesis of the O-chain region of the LPS. 

3. A polynucleotide according to claim 1, wherein the glycosyltransferase is 
involved in the biosynthesis of the core region of the LPS. 

4. A polynucleotide according to claim 2 t wherein the glycosyltransferase is a 
galactosyltransferase. 

5. A polynucleotide according to claim 4, wherein the galactosyltransferase is 
a p-1 ,4-galactosyltransferase. 

6. A polynucleotide according to claim 5, wherein the Helicobacter is a strain 
of H. pylori. 

7. A polynucleotide according to claim 3, wherein the glycosyltransferase is a 
glucosyltransferase. 

8. A polynucleotide according to claim 7, wherein the glycosyltransferase is an 
a-1 ,6-glucosyltransferase. 

9. A polynucleotide according to claim 8, wherein the Helicobacter is a strain 
of H. pylori. 

10. A polynucleotide according to claim 3, wherein the glycosyltransferase is a 
heptosyltranferase. 
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1 1 . A polynucleotide according to claim 10, wherein the heptosyltransferase is 
an ADP-heptose-LPS heptosyltransferase II. 

12. A polynucleotide according to claim 11, wherein the Helicobacter is a 
5 strain of H. pylori. 

13. An isolated or recombinant polynucleotide having sequence selected from 
the group consisting of SEQ ID NO:1, SEQ ID NO:3, SEQ ID NO:5 f SEQ ID 
NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15 and 

io fragments and variants thereof. 

14. An isolated or recombinant polynucleotide having at least about 70% 
identity to the polynucleotide according to claim 13. 

15 15. An isolated or recombinant^ produced polypeptide comprising at least a 
portion of a Helicobacter glycosyltransferase involved in the biosynthesis of a 
Helicobacter lipopolysaccharide (LPS). 

16. A polypeptide according to claim 15, wherein the glycosyltransferase is 
20 involved in the biosynthesis of the O-chain region of the LPS. 

17. A polypeptide according to claim 15, wherein the glucosyltransferase is 
involved in the biosynthesis of the core region of the LPS. 

25 18. A polypeptide according to claim 16, wherein the glycosyltransferase is a 
galactosyltransferase. 

19. A polypeptide according to claim 18, wherein the galactosyltransferase is 
a p-1 ,4-galactosyltransferase. 

30 

20. A polypeptide according to claim 19, wherein the Helicobacter is a strain 
of H pylori. 
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21. A polypeptide according to claim 17, wherein the glycosyltransferase is a 
glucosyltransferase. 

22. A polypeptide according to claim 21, wherein the glucosyltransferase is an 
5 ot-1 ,6-glucosyltransferase. 

23. A polypeptide according to claim 22, wherein the Helicobacter is a strain 
of H. pylori. 

10 24. A polypeptide according to claim 17, wherein the glycosyltransferase is a 
heptosyltranferase. 

25. A polypeptide according to claim 24, wherein the heptosyltransferase is an 
ADP-heptose-LPS heptosyltransferase II. 

15 

26. A polypeptide according to claim 24, wherein the Helicobacter is a strain 
of H. pylori. 

27. An isolated or recombinantly produced polypeptide having sequence 
20 selected from the group consisting of SEQ ID NO:2, SEQ ID NO:4, SEQ ID 

NO:6, SEQ ID NO:8, SEQ ID NO:10, SEQ ID NO:12, SEQ ID NO:14, SEQ ID 
NO: 1 6 and fragments and variants thereof. 

28. An isolated or recombinantly produced polypeptide having at least about 
25 50% identity to the isolated polypeptide according to claim 27. 

29. A recombinant vector comprising a nucleic acid encoding at least a portion 
of a Helicobacter glycosyltransferase involved in the biosynthesis of a 
Helicobacter lipopolysaccharide (LPS). 

30 

30. A recombinant vector according to claim 29, wherein the 
glycosyltransferase is involved in the biosynthesis of the O-chain region of the 
LPS. 
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31. A recombinant vector according to claim 29, wherein the 
glycosyltransferase is involved in the biosynthesis of the core region of the 
LPS. 

5 

32. A recombinant vector according to claim 30, wherein the 
glycosyltransferase is a galactosyltransferase. 

33. A recombinant vector according to claim 32, wherein the 
10 galactosyltransferase is a p-1 ,4-galactosyltransferase. 

34. A recombinant vector according to claim 33, wherein the Helicobacter \s a 
strain of H. pylori. 

15 35. A recombinant vector according to claim 31, wherein the 
glycosyltransferase is a glucosyltransferase. 

36. A recombinant vector according to claim 35, wherein the 
glucosyltransferase is an a-1 ,6-glucosyltransferase. 

20 

37. A recombinant vector according to claim 36, wherein the Helicobacter is a 
strain of H. pylori. 

38. A recombinant vector according to claim 31, wherein the 
25 glycosyltransferase is a heptosyltranferase. 

39. A recombinant vector according to claim 38, wherein the 
heptosyltransferase is an ADP-heptose-LPS heptosyltransferase II. 

30 40. A recombinant vector according to claim 39, wherein the Helicobacter is a 
strain of H. pylori. 
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41. A recombinant vector according to claim 29, wherein the 
giycosyltransferase has a sequence selected from the group consisting of 
SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, 
SEQ ID NO:12, SEQ ID NO:14, SEQ ID NO:16 and fragments and variants 

5 thereof. 

42. An expression cassette that comprises a nucleic acid encoding at least a 
portion of a Helicobacter giycosyltransferase involved in the biosynthesis of a 
Helicobacter lipopolysaccharide (LPS). 

10 

43. An expression cassette according to claim 42 t wherein the 
giycosyltransferase is involved in the biosynthesis of the O-chain region of the 
LPS. 

is 44. An expression cassette according to claim 42, wherein the 
giycosyltransferase is involved in the biosynthesis of the core region of the 
LPS. 

45. An expression cassette according to claim 42, wherein the 
20 giycosyltransferase has a sequence selected from the group consisting of 
SEQ ID NO:2, SEQ ID NO:4, SEQ ID NO:6, SEQ ID NO:8, SEQ ID NO:10, 
SEQ ID NO:t2, SEQ ID NO:14, SEQ ID NO:16 and fragments and variants 
thereof. 

25 46. A host cell comprising a recombinant nucleic acid which can express a 
protein encoding at least a portion of a Helicobacter giycosyltransferase 
involved in the biosynthesis of a Helicobacter lipopolysaccharide (LPS). 

47. A host cell according to claim 46, wherein the giycosyltransferase is 
30 involved in the biosynthesis of the O-chain region of the LPS. 

48. A host cell according to claim 46, wherein the giycosyltransferase is 
involved in the biosynthesis of the core region of the LPS. 
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49. A host cell according to claim 46, wherein the glycosyltransferase has a 
sequence selected from th group consisting of SEQ ID NO:1, SEQ ID NO:3, 
SEQ ID NO:5. SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13, 

5 SEQ ID NO:1 5 and fragments and variants thereof. 

50. A host cell according to claim 49, wherein the cell is a eukaiyotic cell. 

51. A host cell according to claim 49, wherein the cell is a prokaryotic cell. 

52. A host cell according to claim 51 , wherein the prokaryotic cell is a cell of 
E. coti. 

53. A method for producing a polypeptide comprising at least a portion of a 
is Helicobacter glycosyltransferase involved in the biosynthesis of a 

Helicobacter lipopolysaccharide (LPS), comprising the steps of maintaining a 
host cell of claim 46 under conditions suitable for expression of said 
polypeptide and recovering the polypeptide so produced. 

20 54. A method according to claim 53, wherein the glycosyltransferase has a 
sequence selected from the group consisting of SEQ ID NO:1, SEQ ID NO:3, 
SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, SEQ ID NO:11, SEQ ID NO:13. 
SEQ ID NO:15 and fragments and variants thereof. 

25 55. A method according to claim 53, further including the step of purifying the 
recovered polypeptide. 

56. A hybridization probe comprising a portion of a polynucleotide encoding a 
Helicobacter glycosyltransferase involved in the biosynthesis of a 

30 Helicobacter lipopolysaccharide (LPS). 

57. A hybridization probe according to claim 56, wherein the 
glycosyltransferase has a sequence selected from the group consisting of 
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SEQ ID NO:1. SEQ ID NO:3, SEQ ID NO:5, SEQ ID NO:7, SEQ ID NO:9, 
SEQ ID NO:11, SEQ ID NO:13, SEQ ID NO:15 and fragments and variants 
thereof. 

5 58. A hybridization probe according to claim 57, wherein the probe comprises 
at least about 15 nucleotides. 

59. A mutant strain of H. pylori, said mutant strain having deactivated at least 
one gene encoding a glycosyltransferase involved in the biosynthesis of a H. 

10 pylori lipopolysaccharide (LPS). 

60. A mutant strain according to claim 59, wherein the glycosyltransferase is 
involved in the biosynthesis of the O-chain of the LPS. 

is 61. A mutant strain according to claim 59, wherein the glycosyltransferase is 
involved in the biosynthesis of the core region of LPS. 

62. A mutant according to claim 59, wherein the glycosyltransferase is coded 
by open reading frames 0826, 0159, 0479 or 1 191 . 

20 

63. A vaccine composition comprising an antigen derived from a mutant strain 
of H. pylori according to claim 59. 

64. A vaccine composition according to claim 63, wherein the antigen is an at 
25 least partially purified lipopolysaccharide. 

65. A vaccine composition according to claim 64, wherein the antigen is 
conjugated to a protein. 

30 66. A live attenuated vaccine composition comprising a mutant strain of H. 
pylori according to claim 59. 
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67. A reaction mixture for an enzymatic synthesis of a Helicobacter 
lipopolysaccharide or a portion thereof, th mixture comprising an isolated 
polypeptide having activity of a Helicobacter glycosyltransferase involved in 
the biosynthesis of a Helicobacter lipopolysaccharide (LPS). 

5 

68. A reaction mixture according to claim 67, wherein the glycosyltransferase 
is involved in the biosynthesis of the O-chain region of the Helicobacter 
lipopolysaccharide. 

10 69. A reaction mixture according to claim 67, wherein the glycosyltransferase 
is involved in the biosynthesis of the core region of the Helicobacter 
lipopolysaccharide. 

70. A reaction mixture according to claim 66, wherein the bacterial 
15 lipopolysaccharide is a mimic of a Helicobacter lipopolysaccharide. 
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